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Description 



BIOINFORMATICALLY DETECTABLE 
GROUP OF NOVEL HIV REGULATORY 
GENES AND USES THEREOF 

Background of Invention 
FIELD OF THE INVENTION 

[0001] The present invention relates to a group of bioinformati- 
cally detectable novel viral RNA regulatory genes, here 
identified as "viral genomic address messenger"or 
"VGAM"genes. 
DESCRIPTION OF PRIOR ART 

[0002] Small RNAs are known to perform diverse cellular func- 
tions, including post-transcriptional gene expression reg- 
ulation. The first two such RNA genes, Lin-4 and Let-7, 
were identified by genetic analysis of Caenorhabditis Ele- 
gans (Elegans) developmental timing, and were termed 
short temporal RNA (stRNA) (Wightman, B., Ha, I., Ruvkun, 
C, Cell 75, 855 (1993); Erdmann, V.A.. et al., Nucleic 



Acids Res. 29, 189 (2001); Lee, R. C, Feinbaum, R. L, 
Ambros, V., Cell 75, 843 (1993); Reinhart, B. et al., Nature 
403, 901 (2000)). 

[0003] Lin-4 and Let-7 each transcribe a -22 nucleotide (nt) 

RNA, which acts a post transcriptional repressor of target 
mRNAs, by binding to elements in the 3"-untranslated re- 
gion (UTR) of these target mRNAs, which are complimen- 
tary to the 22 nt sequence of Lin-4 and Let-7 respectively. 
While Lin-4 and Let-7 are expressed at different develop- 
mental stage, first larval stage and fourth larval stage re- 
spectively, both specify the temporal progression of cell 
fates, by triggering post-transcriptional control over other 
genes (Wightman, B., Ha, I., Ruvkun, C, Cell 75, 855 
(1993); Slack et al., Mol.Cell 5 ,659 (2000)). Let-7 as well 
as its temporal regulation have been demonstrated to be 
conserved in all major groups of bilaterally symmetrical 
animals, from nematodes, through flies to humans 
(Pasquinelli, A., et al. Nature 408 ,86 (2000)). 

[0004] The initial transcription product of Lin-4 and Let-7 is a 

~60-80nt RNA, the nucleotide sequence of the first half of 
which is partially complimentary to that of its second half, 
therefore allowing this RNA to fold onto itself, forming a 
"hairpin structure". The final gene product is a ~22nt RNA, 



which is "diced" from the above mentioned "hairpin struc- 
ture", by an enzyme called Dicer, which also apparently 
also mediates the complimentary binding of this ~22nt 
segment to a binding site in the 3" UTR of its target gene. 
[0005] Recent studies have uncovered 93 new genes in this class, 
now referred to as micro RNA or miRNA genes, in 
genomes of Elegans, Drosophilea, and Human 
(Lagos-Quintana, M., Rauhut, R., Lendeckel, W., Tuschl, 
T., Science 294 ,853 (2001); Lau, N.C., Lim, LP., Wein- 
stein, E.G., Bartel, D.P., Science 294 ,858 (2001); Lee, R.C., 
Ambros, V., Science 294 ,862 (2001). Like the well studied 
Lin-4 and Let-7, all newly found MIR genes produce a 
~60-80nt RNA having a nucleotide sequence capable of 
forming a "hairpin structure". Expressions of the precursor 
~60-80nt RNA and of the resulting diced ~22nt RNA of 
most of these newly discovered MIR genes have been de- 
tected. 

[0006] Based on the striking homology of the newly discovered 
MIR genes to their well-studied predecessors Lin-4 and 
Let-7, the new MIR genes are believed to have a similar 
basic function as that of Lin-4 and Let-7: modulation of 
target genes by complimentary binding to the UTR of 
these target genes, with special emphasis on modulation 



of developmental control processes. This is despite the 
fact that the above mentioned recent studies did not find 
target genes to which the newly discovered MIR genes 
complementarily bind. While existing evidence suggests 
that the number of regulatory RNA genes "may turn out to 
be very large, numbering in the hundreds or even thou- 
sands in each genome", detecting such genes is challeng- 
ing (Ruvkun C, "Perspective: Glimpses of a tiny RNA 
world", Science 294 ,779 (2001)). 
[0007] The ability to detect novel RNA genes is limited by the 

methodologies used to detect such genes. All RNA genes 
identified so far either present a visibly discernable whole 
body phenotype, as do Lin-4 and Let-7 (Wightman et. al., 
Cell 75, 855 (1993); Reinhart et al., Nature 403, 901 
(2000)), or produce significant enough quantities of RNA 
so as to be detected by the standard biochemical genomic 
techniques, as do the 93 recently detected miRNA genes. 
Since a limited number clones were sequenced by the re- 
searchers discovering these genes, 300 by Bartel and 100 
by Tuschl (Bartel et. al., Science 294 ,858 (2001); Tuschl 
et. al., Science 294 ,853 (2001)), the RNA genes found can 
not be much rarer than 1% of all RNA genes. The recently 
detected miRNA genes therefore represent the more 



prevalent among the miRNA gene family. 
[0008] Current methodology has therefore been unable to detect 
RNA genes which either do not present a visually discern- 
able whole body phenotype, or are rare (e.g. rarer than 
0.1% of all RNA genes), and therefore do not produce sig- 
nificant enough quantities of RNA so as to be detected by 
standard biochemical technique.To date, miRNA have not 
been detected in viruses. 
Summary of Invention 

[0009] The present invention relates to a novel group of bioinfor- 
matically detectable, viral regulatory RNA genes, which re- 
press expression of host target host genes, by means of 
complementary hybridization to binding sites in untrans- 
lated regions of these host target host genes. It is be- 
lieved that this novel group of viral genes represent a per- 
vasive viral mechanism of attacking hosts, and that there- 
fore knowledge of this novel group of viral genes may be 
useful in preventing and treating viral diseases. 

[0010] | n various preferred embodiments, the present invention 
seeks to provide improved method and system for detec- 
tion and prevention of viral disease, which is mediated by 
this group of novel viral genes. 

[0011] Accordingly, the invention provides several substantially 



pure nucleic acids (e.g., genomic nucleic acid, cDNA or 
synthetic nucleic acid) each encoding a novel viral gene of 
the VGAM group of gene, vectors comprising the nucleic 
acids , probes comprising the nucleic acids , a method 
and system for selectively modulating translation of 
known "target" genes utilizing the vectors, and a method 
and system for detecting expression of known "target" 
genes utilizing the probe. 
[0012] By "substantially pure nucleic acid" is meant nucleic acid 
that is free of the genes which, in the naturally-occurring 
genome of the organism from which the nucleic acid of 
the invention is derived, flank the genes discovered and 
isolated by the present invention. The term therefore in- 
cludes, for example, a recombinant nucleic acid which is 
incorporated into a vector, into an autonomously replicat- 
ing plasmid or virus, or into the genomic nucleic acid of a 
prokaryote or eukaryote at a site other than its natural 
site; or which exists as a separate molecule (e.g., a cDNA 
or a genomic or cDNA fragment produced by PCR or re- 
striction endonuclease digestion) independent of other 
sequences. It also includes a recombinant nucleic acid 
which is part of a hybrid gene encoding additional 
polypeptide sequence. 



[0013] "inhibiting translation" is defined as the ability to prevent 
synthesis of a specific protein encoded by a respective 
gene, by means of inhibiting the translation of the mRNA 
of this gene. "Translation inhibiter site" is defined as the 
minimal nucleic acid sequence sufficient to inhibit trans- 
lation. 

[0014] There is thus provided in accordance with a preferred em- 
bodiment of the present invention a bioinformatically de- 
tectable novel viral gene encoding substantially pure nu- 
cleic acid wherein: RNA encoded by the bioinformatically 
detectable novel viral gene is about 18 to about 24 nu- 
cleotides in length, and originates from an RNA precursor, 
which RNA precursor is about 50 to about 120 nucleotides 
in length, a nucleotide sequence of a first half of the RNA 
precursor is a partial inversed-reversed sequence of a nu- 
cleotide sequence of a second half thereof, a nucleotide 
sequence of the RNA encoded by the novel viral gene is a 
partial inversed-reversed sequence of a nucleotide se- 
quence of a binding site associated with at least one host 
target gene, and a function of the novel viral gene is 
bioinformatically deducible. 

[0015] There is further provided in accordance with another pre- 
ferred embodiment of the present invention a method for 



anti-viral treatment comprising neutralizing said RNA. 

[0016] Further in accordance with a preferred embodiment of the 
present invention the neutralizing comprises: synthesizing 
a complementary nucleic acid molecule, a nucleic se- 
quence of which complementary nucleic acid molecule is a 
partial inversed-reversed sequence of said RNA, and 
transfecting host cells with the complementary nucleic 
acid molecule, thereby complementarily binding said RNA. 

[0017] Further in accordance with a preferred embodiment of the 
present invention the neutralizing comprises immunologi- 
cally neutralizing. 

[0018] There is still further provided in accordance with another 
preferred embodiment of the present invention a bioinfor- 
matically detectable novel viral gene encoding substan- 
tially pure nucleic acid wherein: RNA encoded by the 
bioinformatically detectable novel viral gene includes a 
plurality of RNA sections, each of the RNA sections being 
about 50 to about 120 nucleotides in length, and includ- 
ing an RNA segment, which RNA segment is about 18 to 
about 24 nucleotides in length, a nucleotide sequence of a 
first half of each of the RNA sections encoded by the novel 
viral gene is a partial inversed-reversed sequence of nu- 
cleotide sequence of a second half thereof, a nucleotide 



sequence of each of the RNA segments encoded by the 
novel viral gene is a partial inversed-reversed sequence of 
the nucleotide sequence of a binding site associated with 
at least one target host gene, and a function of the novel 
viral gene is bioinformatically deducible from the follow- 
ing data elements: the nucleotide sequence of the RNA 
encoded by the novel viral gene, a nucleotide sequence of 
the at least one target host gene, and function of the at 
least one target host gene. 

[0019] Further in accordance with a preferred embodiment of the 
present invention the function of the novel viral gene is 
bioinformatically deducible from the following data ele- 
ments: the nucleotide sequence of the RNA encoded by 
the bioinformatically detectable novel viral gene, a nu- 
cleotide sequence of the at least one target host gene, 
and a function of the at least one target host gene. 

[0020] still further in accordance with a preferred embodiment of 
the present invention the RNA encoded by the novel viral 
gene complementarily binds the binding site associated 
with the at least one target host gene, thereby modulating 
expression of the at least one target host gene. 

[0021] Additionally in accordance with a preferred embodiment 
of the present invention the binding site associated with 



at least one target host gene is located in an untranslated 
region of RNA encoded by the at least one target host 
gene. 

[0022] Moreover in accordance with a preferred embodiment of 
the present invention the function of the novel viral gene 
is selective inhibition of translation of the at least one tar- 
get host gene, which selective inhibition includes comple- 
mentary hybridization of the RNA encoded by the novel 
viral gene to the binding site. 

[0023] Further in accordance with a preferred embodiment of the 
present invention the invention includes a vector including 
the DNA. 

[0024] still further in accordance with a preferred embodiment of 
the present invention the invention includes a method of 
selectively inhibiting translation of at least one gene, in- 
cluding introducing the vector. 

[0025] Moreover in accordance with a preferred embodiment of 
the present invention the introducing includes utilizing 
RNAi pathway. 

[0026] Additionally in accordance with a preferred embodiment 
of the present invention the invention includes a gene ex- 
pression inhibition system including: the vector, and a 
vector inserter, functional to insert the vector into a cell, 



thereby selectively inhibiting translation of at least one 
gene. 

[0027] Further in accordance with a preferred embodiment of the 
present invention the invention includes a probe including 
the DNA. 

[0028] still further in accordance with a preferred embodiment of 
the present invention the invention includes a method of 
selectively detecting expression of at least one gene, in- 
cluding using the probe. 

[0029] Additionally in accordance with a preferred embodiment 
of the present invention the invention includes a gene ex- 
pression detection system including: the probe, and a 
gene expression detector functional to selectively detect 
expression of at least one gene. 

[0030] Further in accordance with a preferred embodiment of the 
present invention the invention includes an anti-viral sub- 
stance capable of neutralizing the RNA. 

[0031] still further in accordance with a preferred embodiment of 
the present invention the neutralizing includes comple- 
mentarily binding the RNA. 

[0032] Additionally in accordance with a preferred embodiment 
of the present invention the neutralizing includes im- 
munologically neutralizing. 



[0033] Moreover in accordance with a preferred embodiment of 
the present invention the invention includes a method for 
anti-viral treatment including neutralizing the RNA. 

[0034] Further in accordance with a preferred embodiment of the 
present invention the neutralizing includes: synthesizing a 
complementary nucleic acid molecule, a nucleic sequence 
of which complementary nucleic acid molecule is a partial 
inversed-reversed sequence of the RNA, and transfecting 
host cells with the complementary nucleic acid molecule, 
thereby complementarily binding the RNA. 

[0035] still further in accordance with a preferred embodiment of 
the present invention the neutralizing includes immuno- 
logically neutralizing. 
Brief Description of Drawings 

[0036] Fig. 1 is a simplified diagram illustrating a mode by which 
viral genes of a novel group of viral genes of the present 
invention, modulate expression of known host target 
genes; 

[0037] Fig. 2 is a simplified block diagram illustrating a bioinfor- 
matic gene detection system capable of detecting genes 
of the novel group of genes of the present invention, 
which system is constructed and operative in accordance 
with a preferred embodiment of the present invention; 



[0038] pig. 3 is a simplified flowchart illustrating operation of a 
mechanism for training of a computer system to recog- 
nize the novel genes of the present invention, which 
mechanism is constructed and operative in accordance 
with a preferred embodiment of the present invention; 

[0039] pig. 4A is a simplified block diagram of a non-coding ge- 
nomic sequence detector constructed and operative in ac- 
cordance with a preferred embodiment of the present in- 
vention; 

[0040] Fig. 4B is a simplified flowchart illustrating operation of a 
non-coding genomic sequence detector constructed and 
operative in accordance with a preferred embodiment of 
the present invention; 

[0041] Fig. 5A is a simplified block diagram of a hairpin detector 
constructed and operative in accordance with a preferred 
embodiment of the present invention; 

[0042] Fig. 5B is a simplified flowchart illustrating operation of a 
hairpin detector constructed and operative in accordance 
with a preferred embodiment of the present invention; 

[0043] Fig. 6A is a simplified block diagram of a dicer-cut loca- 
tion detector constructed and operative in accordance 
with a preferred embodiment of the present invention; 

[0044] Fig. 6B is a simplified flowchart illustrating training of a 



dicer-cut location detector constructed and operative in 
accordance with a preferred embodiment of the present 
invention; 

[0045] pig. 7 A is a simplified block diagram of a target-gene 

binding-site detector constructed and operative in accor- 
dance with a preferred embodiment of the present inven- 
tion; 

[0046] pig. 7B is a simplified flowchart illustrating operation of a 
target-gene binding-site detector constructed and opera- 
tive in accordance with a preferred embodiment of the 
present invention; 

[0047] Fig. 8 is a simplified flowchart illustrating operation of a 
function & utility analyzer constructed and operative in 
accordance with a preferred embodiment of the present 
invention; 

[0048] Fig. 9 is a simplified diagram describing a novel bioinfor- 
matically detected group of regulatory genes, referred to 
here as Genomic Record (CR) genes, each of which en- 
codes an "operon-like" cluster of novel miRNA-like genes, 
which in turn modulates expression of a plurality of target 
genes; 

[0049] Fig. 10 is a block diagram illustrating different utilities of 
genes of a novel group of genes, and operons of a novel 



group of operons, both of the present invention; 
[0050] pigs. 11A and 11B are simplified diagrams, which when 
taken together illustrate a mode of gene therapy applica- 
ble to genes of the novel group of genes of the present 
invention; 

[0051] pig. 12A is an annotated sequence of EST72223 compris- 
ing novel gene GAM24 detected by the gene detection 
system of the present invention; 

[0052] Figs. 12B and 12C are pictures of laboratory results, which 
when taken together demonstrate laboratory confirmation 
of expression of the bioinformatically detected novel gene 
CAM24 of Fig. 12A; 

[0053] Fig. 12D provides pictures of laboratory results, which 

when taken together demonstrate further laboratory con- 
firmation of expression of the bioinformatically detected 
novel gene GAM24 of Fig. 12A; 

[0054] Fig. 13A is an annotated sequence of an EST7929020 

comprising novel genes CAM23 and CAM25 detected by 
the gene detection system of the present invention; 

[0055] Fig. 13B is a picture of laboratory results, which confirm 
expression of bioinformatically detected novel genes 
GAM23 and GAM25 of Fig. 13A; 

[0056] Fig. 13C is a picture of laboratory results, which confirm 



endogenous expression of bioinformatically detected 
novel gene GAM25 of Fig. 15A; 

[0057] pig. 14A is an annotated sequence of an EST1388749 

comprising novel gene GAM26 detected by the gene de- 
tection system of the present invention; 

[0058] pigs. 14B is a picture of laboratory results, which confirm 
expression of the bioinformatically detected novel gene 
GAM26 of Fig. 14A; 

[0059] Figs. 15A through 29D are schematic diagrams illustrating 
sequences, functions and utilities of 15 specific viral 
genes of the novel group of viral regulatory genes of the 
present invention, detected using the bioinformatic gene 
detection system described hereinabove with reference to 
Figs. 1 through 8; and 

[0060] Figs. 30 through 31 are schematic diagrams illustrating 
sequences, functions and utilities of 2 specific viral genes 
of a group of novel regulatory "operon-like" viral genes of 
the present invention, detected using the bioinformatic 
gene detection system described hereinabove with refer- 
ence to Figs. 9 through 14. 
Brief Description of Sequences 

[0061] a Sequence Listing of genomic sequences of the present 
invention designated SEQ ID:1 through SEQ ID:406 is at- 



tached to this application, enclosed in computer readable 
form on CD-ROM. The genomic listing comprises the fol- 
lowing nucleotide sequences:Cenomic sequences desig- 
nated SEQ ID:1 through SEQ ID: 15 are nucleotide se- 
quences of 15 gene precursors of respective novel genes 
of the present invention;Genomic sequences designated 
SEQ ID: 16 through SEQ ID:30 are nucleotide sequences of 
15 genes of the present invention; andGenomic sequences 
designated SEQ ID:31 through SEQ ID:406 are nucleotide 
sequences of 376 gene precursors of respective novel 
genes of the present invention. 
Detailed Description 

[0062] Reference is now made to Fig. 1 which is a simplified dia- 
gram illustrating a mode by which genes of a novel group 
of genes of the present invention, modulate expression of 
known host target. 

[0063] The novel genes of the present invention are micro RNA 
(miRNA)-like, regulatory RNA genes, modulating expres- 
sion of known host target. This mode of modulation is 
common to other known miRNA genes, as described here- 
inabove with reference to the background of the invention 
section. 

[0064] VGAM GENE and TARGET GENE are two human genes con- 



tained in the DNA of the human genome. 
[0065] VGAM GENE encodes a VGAM PRECURSOR RNA. However, 
similar to other miRNA genes, and unlike most ordinary 
genes, its RNA, VGAM PRECURSOR RNA, does not encode a 
protein. 

[0066] VGAM PRECURSOR RNA folds onto itself, forming VGAM 
FOLDED PRECURSOR RNA. As Fig.8 illustrates, VGAM 
FOLDED PRECURSOR RNA forms a "hairpin structure", 
folding onto itself. As is well known in the art, this "hair- 
pin structure", is typical genes of the miRNA genes, and is 
due to the fact that nucleotide sequence of the first half of 
the RNA of a gene in this group is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
its second half. By "inversed-reversed"is meant a se- 
quence which is reversed and wherein each nucleotide is 
replaced by a complimentary nucleotide, as is well known 
in the art (e.g. ATGGC is the inversed-reversed sequence 
of GCCAT). 

[0067] An enzyme complex, designated DICER COMPLEX, "dices" 
the VGAM FOLDED PRECURSOR RNA into a single stranded 
RNA segment, about 22 nucleotides long, designated 
VGAM RNA. As is known in the art, "dicing" of the hairpin 
structured RNA precursor into shorter RNA segments 



about 22 nucleotides long by a Dicer type enzyme is cat- 
alyzed by an enzyme complex comprising an enzyme 
called Dicer together with other necessary proteins. 

[0068] TARGET GENE encodes a corresponding messenger RNA, 
designated TARGET RNA. This TARGET RNA comprises 3 
regions: a 5" untranslated region, a protein coding region 
and a 3" untranslated region, designated 5"UTR, PROTEIN 
CODING and 3"UTR respectively. 

[0069] VGAM RNA binds complementarily a BINDING SITE, located 
on the 3"UTR segment of TARGET RNA. This complemen- 
tarily binding is due to the fact that the nucleotide se- 
quence of VGAM RNA is an accurate or partial inversed- 
reversed sequence of the nucleotide sequence of BINDING 
SITE. 

[0070] T he complimentary binding of VGAM RNA to BINDING SITE 
inhibits translation of TARGET RNA into TARGET PROTEIN. 
TARGET PROTEIN is therefore outlined by a broken line. 

[0071] | t j S appreciated by one skilled in the art that the mode of 
transcriptional inhibition illustrated by Fig. 1 with specific 
reference to VGAM genes of the present invention, is in 
fact common to all other miRNA genes. A specific compli- 
mentary binding site has been demonstrated only for Lin- 
4 and Let-7. All the other 93 newly discovered miRNA 



genes are also believed by those skilled in the art to mod- 
ulate expression of other genes by complimentary bind- 
ing, although specific complimentary binding sites for 
these genes have not yet been found (Ruvkun C, "Perspec- 
tive: Glimpses of a tiny RNA world", Science 294 ,779 
(2001)). The present invention discloses a novel group of 
genes, the VGAM genes, belonging to the miRNA genes 
group, and for which a specific an complimentary binding 
has been determined. 

[0072] Reference is now made to Fig. 2 which is a simplified 

block diagram illustrating a bioinformatic gene detection 
system capable of detecting genes of the novel group of 
genes of the present invention, which system is con- 
structed and operative in accordance with a preferred em- 
bodiment of the present invention. 

[0073] a centerpiece of the present invention is a bioinformatic 
gene detection engine 100, which is a preferred imple- 
mentation of a mechanism capable of bioinformatically 
detecting genes of the novel group of genes of the 
present invention. 

[0074] The function of the bioinformatic gene detection engine 
100 is as follows: it receives three types of input, ex- 
pressed RNA data 102, sequenced DNA data 104, and 



protein function data 106, performs a complex process of 
analysis of this data as elaborated below, and based on 
this analysis produces output of a bioinformatically de- 
tected group of novel genes designated 108. 
[0075] Expressed RNA data 102 comprises published expressed 
sequence tags (EST) data, , published mRNA data, as well 
as other sources of published RNA data. Sequenced DNA 
data 104 comprises alphanumeric data describing se- 
quenced genomic data, which preferably includes annota- 
tion data such as location of known protein coding re- 
gions relative to the sequenced data. Protein function data 
106 comprises scientific publications reporting studies 
which elucidated physiological function known proteins, 
and their connection, involvement and possible utility in 
treatment and diagnosis of various diseases. Expressed 
RNA data 102, sequenced DNA data 104 may preferably 
be obtained from data published by the National Center 
for Bioinformatics (NCBI) at the National Institute of Health 
(NIH), as well as from various other published data 
sources. Protein function data 106 may preferably be ob- 
tained from any one of numerous relevant published data 
sources, such as the Online Mendelian Inherited Disease 
In Man (OMIM) database developed by John Hopkins Uni- 



versity, and also published by NCBI. 

[0076] p r j or to actual detection of bioinformatically detected 

novel genes 108 by the bioinformatic gene detection en- 
gine 100, a process of bioinformatic gene detection en- 
gine training & validation designated 110 takes place. 
This process uses the known miRNA genes as a training 
set (some 200 such genes have been found to date using 
biological laboratory means), to train the bioinformatic 
gene detection engine 100 to bioinformatically recognize 
miRNA-like genes, and their respective potential target 
binding sites. Bioinformatic gene detection engine training 
& validation 110 is further describe hereinbelow with ref- 
erence to Fig. 3. 

[0077] The bioinformatic gene detection engine 100 comprises 
several modules which are preferably activated sequen- 
tially, and are described as follows: 

[0078] a non-coding genomic sequence detector 112 operative 
to bioinformatically detect non-protein coding genomic 
sequences. The non-coding genomic sequence detector 
112 is further described hereinbelow with reference to 
Figs. 4A and 4B. 

[0079] a hairpin detector 114 operative to bioinformatically de- 
tect genomic "hairpin-shaped" sequences, similar to 



VGAM FOLDED PRECURSOR of Fig. 1. The hairpin detector 
114 is further described hereinbelow with reference to 
Figs. 5A and 5B. 

[0080] a dicer-cut location detector 116 operative to bioinfor- 

matically detect the location on a hairpin shaped sequence 
which is enzymatically cut by DICER COMPLEX of Fig. 1. 
The dicer-cut location detector 116 is further described 
hereinbelow with reference to Fig. 6A. 

[0081] a target-gene binding-site detector 118 operative to 

bioinformatically detect host target having binding sites, 
the nucleotide sequence of which is partially complemen- 
tary to that of a given genomic sequence, such as a se- 
quence cut by DICER COMPLEX of Fig. 1. The target-gene 
binding-site detector 118 is further described hereinbe- 
low with reference to Figs. 7 A and 7B. 

[0082] a function & utility analyzer 120 operative to analyze 

function and utility of host target, in order to identify host 
target which have a significant clinical function and utility. 
The function & utility analyzer 120 is further described 
hereinbelow with reference to Fig. 8. 

[0083] Hardware implementation of the bioinformatic gene de- 
tection engine 100 is important, since significant comput- 
ing power is preferably required in order to perform the 



computation of bioinformatic gene detection engine 100 
in reasonable time and cost. As an example, it is esti- 
mated that using one powerful 8-processor PC Server, 
over 30 months of computing time (at 24 hours per day) 
would be required in order to detect all miRNA genes in 
human EST data, and their respective binding sites. 
[0084] For example, in order to address this challenge at reason- 
able time and cost, a preferred embodiment of the 
present invention may comprise a cluster of a large num- 
ber of personal computers (PCs), such as 100 PCs 
(Pentium IV, 1.7GHz, with 40GB storage each), connected 
by Ethernet to several strong servers, such as 4 servers 
(2-CPU, Xeon 2.2GHz, with 200GB storage each), com- 
bined with an 8-processor server (8-CPU, Xeon 550Mhz 
w/ 8GB RAM) connected via 2 HBA fiber-channels to an 
EMC Clariion 100-disks, 3.6 Terabyte storage device. Ad- 
ditionally, preferably an efficient database computer pro- 
gram, such as Microsoft (TM) SQL-Server database com- 
puter program is used and is optimized to the specific re- 
quirements of bioinformatic gene detection engine 100. 
Furthermore, the PCs are preferably optimized to operate 
close to 100% CPU usage continuously, as is known in the 
art. Using suitable hardware and software may preferably 



reduce the required calculation time in the abovemen- 
tioned example from 30 months to 20 days. 
[0085] it is appreciated that the abovementioned hardware con- 
figuration is not meant to be limiting, and is given as an 
illustration only. The present invention may be imple- 
mented in a wide variety of hardware and software config- 
urations. 

[0086] The present invention discloses 15 novel viral genes of 
the VGAM group of genes, which have been detected 
bioinformatically, as described hereinbelow with reference 
to Figs. 15 through 29. Laboratory confirmation of 4 
genes of the GAM group of genes is described hereinbe- 
low with reference to Figs. 12 through 14. 

[0087] Reference is now made to Fig. 3 which is a simplified 

flowchart illustrating operation of a mechanism for train- 
ing of a computer system to recognize the novel genes of 
the present invention. This mechanism is a preferred im- 
plementation of the bioinformatic gene detection engine 
training & validation 110 described hereinabove with ref- 
erence to Fig. 2. 

[0088] Bioinformatic gene detection engine training & validation 
110 of Fig. 2 begins by training the bioinformatic gene 
detection engine to recognize known miRNA genes, as 



designated by numeral 122. This training step comprises 
hairpin detector training & validation 124, further de- 
scribed hereinbelow with reference to Fig. 12 A, dicer-cut 
location detector training & validation 126, further de- 
scribed hereinbelow with reference to Fig. 6A and 6B, and 
target-gene binding-site detector training & validation 
128, further described hereinbelow with reference to Fig. 
7A. 

[0089] Next, the bioinformatic gene detection engine 100 is used 
to bioinformatically detect sample novel genes, as desig- 
nated by numeral 130. An example of a sample novel 
gene thus detected is described hereinbelow with refer- 
ence to Fig. 12. 

[0090] Finally, wet lab experiments are preferably conducted in 
order to validate expression and preferably function the 
sample novel genes detected by the bioinformatic gene 
detection engine 100 in the previous step. An example of 
wet-lab validation of the abovementioned sample novel 
gene bioinformatically detected by the system is de- 
scribed hereinbelow with reference to Figs. 13Aand 13B. 

[0091] Reference is now made to Fig. 4A which is a simplified 

block diagram of a preferred implementation of the non- 
coding genomic sequence detector 112 described herein- 



above with reference to Fig. 2. Non-protein coding ge- 
nomic sequence detector 112 of Fig. 2 preferably receives 
as input at least two types of published genomic data: ex- 
pressed RNA data 102, including EST data and mRNA 
data, and sequenced DNA data 104. After its initial train- 
ing, indicated by numeral 134, and based on the above- 
mentioned input data, the non-protein coding genomic 
sequence detector 112 produces as output a plurality of 
non-protein coding genomic sequences 136. Preferred 
operation of the non-protein coding genomic sequence 
detector 112 is described hereinbelow with reference to 
Fig. 4B. 

[0092] Reference is now made to Fig. 4B which is a simplified 
flowchart illustrating a preferred operation of the non- 
coding genomic sequence detector 112 of Fig. 2. Detec- 
tion of non-protein coding genomic sequences to be fur- 
ther analyzed by the system generally preferably pro- 
gresses in one of the following two paths. 

[0093] a first path for detecting non-protein coding genomic se- 
quences begins by receiving a plurality of known RNA se- 
quences, such as EST data. Each RNA sequence is first 
compared to all known protein-coding sequences, in or- 
der to select only those RNA sequences which are non- 



protein coding. This can preferably be performed by 
BLAST comparison of the RNA sequence to known protein 
coding sequences. The abovementioned BLAST compari- 
son to the DNA preferably also provides the localization of 
the RNA on the DNA. 

[0094] Optionally, an attempt may be made to "expand" the non- 
protein RNA sequences thus found, by searching for tran- 
scription start and end signals, upstream and downstream 
of location of the RNA on the DNA respectively, as is well 
known in the art. 

[0095] a second path for detecting non-protein coding genomic 
sequences starts by receiving DNA sequences. The DNA 
sequences are parsed into non protein coding sequences, 
based on published DNA annotation data: extracting those 
DNA sequences which are between known protein coding 
sequences. Next, transcription start and end signals are 
sought. If such signals are found, and depending on their 
"strength", probable expressed non-protein coding ge- 
nomic sequences are yielded. 

[0096] Reference is now made to Fig. 5A which is a simplified 

block diagram of a preferred implementation of the hair- 
pin detector 114 described hereinabove with reference to 
Fig. 2. 



[0097] The goal of the hairpin detector 114 is to detect "hairpin" 
shaped genomic sequences, similar to those of known 
miRNA genes. As mentioned hereinabove with reference 
to Fig. 1, a "hairpin" genomic sequence refers to a ge- 
nomic sequence which "folds onto itself forming a hairpin 
like shape, due to the fact that nucleotide sequence of the 
first half of the nucleotide sequence is an accurate or 

[0098] The hairpin detector 114 of Fig. 2 receives as input a plu- 
rality of non-protein coding genomic sequences 136 of 
Fig. 4A, and after a phase of hairpin detector training & 
validation 124 of Fig. 3, is operative to detect and output 
"hairpin shaped" sequences found in the input expressed 
non-protein coding sequences, designated by numeral 
138. 

[0099] The phase of hairpin detector training & validation 124 is 
an iterative process of applying the hairpin detector 114 
to known hairpin shaped miRNA genes, calibrating the 
hairpin detector 114 such that it identifies the training set 
of known hairpins, as well as sequences which are similar 
thereto. Preferred operation of the hairpin detector 114 is 
described hereinbelow with reference to Fig. 5B. 

[0100] Reference is now made to Fig. 5B which is a simplified 

flowchart illustrating a preferred operation of the hairpin 



detector 114 of Fig. 2. 

[0101] a hairpin structure is a two dimensional folding structure, 
resulting from the nucleotide sequence pattern: the nu- 
cleotide sequence of the first half of the hairpin sequence 
is an inversed-reversed sequence of the second half 
thereof. Different methodologies are known in the art for 
detection of various two dimensional and three dimen- 
sional hairpin structures. 

[0102] | n a preferred embodiment of the present invention, the 
hairpin detector 114 initially calculates possible 
2-dimensional (2D) folding patterns of a given one of the 
non-protein coding genomic sequences 136, preferably 
using a 2D folding algorithm based on free-energy calcu- 
lation, such as the Zucker algorithm, as is well known in 
the art. 

[0103] Next, the hairpin detector 114 analyzes the results of the 
2D folding, in order to determine the presence, and loca- 
tion of hairpin structures. A 2D folding algorithm typically 
provides as output a listing of the base-pairing of the 2D 
folded shape, i.e. a listing of which all two pairs of nu- 
cleotides in the sequence which will bond. The goal of this 
second step, is to asses this base-pairing listing, in order 
to determine if it describes a hairpin type bonding pat- 



tern. 

[0104] The hairpin detector 114 then assess those hairpin struc- 
tures found by the previous step, comparing them to hair- 
pins of known miRNA genes, using various parameters 
such as length, free-energy, amount and type of mis- 
matches, etc. Only hairpins that bear statistically signifi- 
cant resemblance of the population of hairpins of known 
miRNAs, according to the abovementioned parameters are 
accepted. 

[0105] Lastly, the hairpin detector 114 attempts to select those 
hairpin structures which are as stable as the hairpins of 
know miRNA genes. This may be achieved in various man- 
ners. A preferred embodiment of the present invention 
utilizes the following methodology comprising three 
steps: 

[0106] First, the hairpin detector 114 attempts to group potential 
hairpins into "families" of closely related hairpins. As is 
known in the art, a free-energy calculation algorithm, typ- 
ically provides multiple "versions" each describing a dif- 
ferent possible 2D folding pattern for the given genomic 
sequence, and the free energy of such possible folding. 
The hairpin detector 114 therefore preferably assesses all 
hairpins found on all "versions", grouping hairpins which 



appear in different versions, but which share near identi- 
cal locations into a common "family" of hairpins. For ex- 
ample, all hairpins in different versions, the center of 
which is within 7 nucleotides of each other may preferably 
be grouped to a single "family". 

[0107] Next, hairpin "families" are assessed, in order to select 
only those families which represent hairpins that are as 
stable as those of known miRNA hairpins. For example, 
preferably only families which are represented in at least 
65% of the free-energy calculation 2D folding versions, 
are considered stable. 

[0108] Finally, an attempt is made to select the most suitable 

hairpin from each selected family. For example, preferably 
the hairpin which appears in more versions than other 
hairpins, and in versions the free-energy of which is 
lower, may be selected. 

[0109] Reference is now made to Fig. 6A which is a simplified 

block diagram of a preferred implementation of the dicer- 
cut location detector 116 described hereinabove with ref- 
erence to Fig. 2. 

[0110] The goal of the dicer-cut location detector 116 is to de- 
tect the location in which DICER COMPLEX of Fig. 1, com- 
prising the enzyme Dicer, would "dice" the given hairpin 



sequence, similar to VGAM FOLDED PRECURSOR RNA, 
yielding VGAM RNA both of Fig. 1. 

The dicer-cut location detector 116 of Fig. 2 therefore re- 
ceives as input a plurality of hairpins on genomic se- 
quences 138 of Fig. 5A, which were calculated by the pre- 
vious step, and after a phase of dicer-cut location detec- 
tor training & validation 126 of Fig. 3, is operative to de- 
tect a respective plurality of dicer-cut sequences from 
hairpins 140, one for each hairpin. 

[° 112 ] In a preferred embodiment of the present invention, the 
dicer-cut location detector 116 preferably uses a combi- 
nation of neural networks, Bayesian networks, Markovian 
modeling, and Support Vector Machines (SVMs) trained on 
the known dicer-cut locations of known miRNA genes, in 
order to detect dicer-cut locations. Dicer-cut location de- 
tector training & validation 126, which is further described 
hereinbelow with reference to Fig. 6B. 

[0113] Reference is now made to Fig. 6 B which is a simplified 

flowchart illustrating a preferred implementation of dicer- 
cut location detector training & validation 126 of Fig. 3. 
Dicer-cut location detector 116 first preprocesses known 
miRNA hairpins and their respective dicer-cut locations, 
so as to be able to properly analyze them and train the 



detection system accordingly: 

[0114] The folding pattern is calculated for each known miRNA, 
preferably based on free-energy calculation, and the size 
of the hairpin, the size of the loop at the center of the 
hairpin, and "bulges" (i.e. mismatched base-pairs) in the 
folded hairpin are noted. 

[0115] The dicer-cut location, which is known for known miRNA 
genes, is noted relative to the above, as well as to the nu- 
cleotides in each location along the hairpin. Frequency of 
identity of nucleotides, and nucleotide-pairing, relative to 
their location in the hairpin, and relative to the known 
dicer-cut location in the known miRNA genes is analyzed 
and modeled. 

[0116] Different techniques are well known in the art for analysis 
of existing pattern from a given "training set" of species 
belonging to a genus, which techniques are then capable, 
to a certain degree, to detect similar patterns in other 
species not belonging to the training-set genus. Such 
techniques include, but are not limited to neural net- 
works, Bayesian networks, Support Vector Machines (SVM), 
Genetic Algorithms, Markovian modeling, and others, as is 
well known in the art. 

[O 117 ] Using such techniques, preferably a combination of sev- 



eral of the above techniques, the known hairpins are rep- 
resented as a several different networks (such as neural, 
Bayesian, or SVM) input and output layers. Both nu- 
cleotide, and "bulge" (i.e. nucleotide pairing or mismatch) 
are represented for each position in the hairpin, at the in- 
put layer, and a corresponding true/false flag at each po- 
sition, indicating whether it was diced by dicer at the out- 
put layer. Multiple networks are preferably used concur- 
rently, and the results therefrom are integrated and fur- 
ther optimized. Markovian modeling may also be used to 
validate the results and enhance their accuracy. Finally, 
the bioinformatic detection of dicer-cut location of a 
sample novel is confirmed by wet-lab experimentation. 
18 ] Reference is now made to Fig. 7A which is a simplified 
block diagram of a preferred implementation of the tar- 
get-gene binding-site detector 118 described herein- 
above with reference to Fig. 2. The goal of the target- 
gene binding-site detector 118 is to detect a BINDING 
SITE of Fig. 1, located in an untranslated region of the 
RNA of a known gene, the nucleotide sequence of which 
BINDING SITE is at least partially complementary to that of 
a VGAM RNA of Fig. 1, thereby determining that the 
abovementioned known gene is a target gene of VGAM of 



Fig. 1. 

[0119] The target-gene binding-site detector 118 of Fig. 2 
therefore receives as input a plurality of dicer-cut se- 
quences from hairpins 140 of Fig. 6A which were calcu- 
lated by the previous step, and a plurality of potential tar- 
get gene sequences 142 which derive sequence DNA data 
104 of Fig. 2, and after a phase of target-gene binding- 
site detector training & validation 128 of Fig. 3, is opera- 
tive to detect target-genes having binding site/s 144 the 
nucleotide sequence of which is at least partially comple- 
mentary to that of each of the plurality of dicer-cut se- 
quences from hairpins 140. Preferred operation of the 
target-gene binding-site detector is further described 
hereinbelow with reference to Fig. 7B. 

[0120] Reference is now made to Fig. 7B which is a simplified 

flowchart illustrating a preferred operation of the target- 
gene binding-site detector 118 of Fig. 2. In a preferred 
embodiment of the present invention, the target-gene 
binding-site detector 118 first performs a BLAST compar- 
ison of the nucleotide sequence of each of the plurality of 
dicer-cut sequences from hairpins 140, to the potential 
target gene sequences 142, in order to find crude poten- 
tial matches. Blast results are then filtered to results which 



are similar to those of known binding sites (e.g. binding 
sites of miRNA genes Lin-4 and Let-7 to target genes Lin- 
14, Lin-41, Lin 28 etc.). Next the binding site is ex- 
panded, checking if nucleotide sequenced immediately 
adjacent to the binding site found by BLAST, may improve 
the match. Suitable binding sites, then are computed for 
free-energy and spatial structure. The results are ana- 
lyzed, selecting only those binding sites, which have free- 
energy and spatial structure similar to that of known 
binding sites. 

[0121] Reference is now made to Fig. 8 which is a simplified 

flowchart illustrating a preferred operation of the function 
& utility analyzer 120 described hereinabove with refer- 
ence to Fig. 2. The goal of the function & utility analyzer 
120 is to determine if a potential target gene is in fact a 
valid clinically useful target gene. Since a potential novel 
VGAM gene binding a binding site in the UTR of a target 
gene is understood to inhibit expression of that target 
gene, and if that target gene is shown to have a valid clin- 
ical utility, then in such a case it follows that the potential 
novel gene itself also has a valid useful function which is 
the opposite of that of the target gene. 

[0122] The function & utility analyzer 120 preferably receives as 



input a plurality of potential novel target genes having 
binding-site/s 144, generated by the target-gene bind- 
ing-site detector 118, both of Fig. 7A. Each potential 
gene, is evaluated as follows: 
[0123] First the system first checks to see if the function of the 
potential target gene is scientifically well established. 
Preferably, this can be achieved bioinformatically by 
searching various published data sources presenting in- 
formation on known function of proteins. Many such data 
sources exist and are published as is well known in the 
art. 

[0124] Next, for those target genes the function of which is sci- 
entifically known and is well documented, the system then 
checks if scientific research data exists which links them 
to known diseases. For example, a preferred embodiment 
of the present invention utilizes the OMIM(TM) database 
published by NCBI, which summarizes research publica- 
tions relating to genes which have been shown to be as- 
sociated with diseases. 

[0125] Finally, the specific possible utility of the target gene is 
evaluated. While this process too may be facilitated by 
bioinformatic means, it might require human evaluation of 
published scientific research regarding the target gene, in 



order to determine the utility of the target gene to the di- 
agnosis and or treatment of specific disease. Only poten- 
tial novel genes, the target-genes of which have passed 
all three examinations, are accepted as novel genes. 

[0126] Reference is now made to Fig. 9, which is a simplified dia- 
gram describing a novel bioinformatically detected group 
of regulatory genes, referred to here as Genomic Record 
(GR) genes, that encode an "operon-like" cluster of novel 
miRNA-like genes, each modulating expression of a plu- 
rality of host target, the function and utility of which tar- 
get genes is known. 

[0127] gr GENE (Genomic Record Gene) is gene of a novel, bioin- 
formatically detected group of regulatory, non protein 
coding, RNA genes. The method by which GR is detected 
is described hereinabove with reference to FIGS. 6-15. 

[0128] gr GENE encodes an RNA molecule, typically several hun- 
dred nucleotides long, designated GR PRECURSOR RNA. 

[0129] GR PRECURSOR RNA folds spatially, as illustrated by GR 

FOLDED PRECURSOR RNA, into a plurality of what is known 
in the art as "hair-pin" structures. The nucleotide se- 
quence of GR PRECURSOR RNA comprises a plurality of 
segments, the first half of each such segment having a 
nucleotide sequence which is at least a partial inversed- 



reversed sequence of the second half thereof, thereby 
causing formation of a plurality of "hairpin" structures, as 
is well known in the art. 

[0 1 30] GR FOLDED PRECURSOR RNA is naturally processed by 

cellular enzymatic activity, into 3 separate hairpin shaped 
RNA segments, each corresponding to VGAM PRECURSOR 
RNA of Fig. 1, designated VGAM1 PRECURSOR, VGAM 2 
PRECURSOR and VGAM 3 PRECURSOR respectively. 

[0131] The above mentioned VGAM precursors, are diced by 

Dicer of FIG. 1, yielding short RNA segments of about 22 
nucleotides in length, each corresponding to VGAM RNA 
of FIG. 1, designated VGAM1, VGAM 2 and VGAM 3 respec- 
tively. 

[0132] VGAM1, VGAM 2 and VGAM 3 each bind complementarily to 
binding sites located in untranslated regions of respective 
host target, designated VGAM 1 -TARGET RNA, 
VGAM 2 -TARG ET RNA and VGAM 3 -TARG ET RNA respec- 
tively. This binding inhibits translation of the respective 
target proteins designated VGAM 1 -TARGET PROTEIN, 
VGAM 2 -TARGET PROTEIN and VGAM 3 -TARGET PROTEIN 
respectively. 

[0133] The structure of VGAM genes comprised in a GR GENE, 
and their mode of modulation of expression of their re- 



spective target genes is described hereinabove with refer- 
ence to Fig. 1. The bioinformatic approach to detection of 
VGAM genes comprised in a GR GENE is described herein- 
above with reference to Figs. 9 through 14. 

[0134] The present invention discloses 17 novel viral genes of 
the GR group of genes, which have been detected bioin- 
formatically, as described hereinbelow with reference to 
Figs. 15 through 31. Laboratory confirmation of 3 genes 
of the GR group of genes is described hereinbelow with 
reference to Figs. 9A through 14. 

[0135] | n summary, the current invention discloses a very large 
number of novel viral GR genes, each of which encodes a 
plurality of VGAM genes, which in turn may modulate ex- 
pression of a plurality of host target proteins. 

[0136] Reference is now made to Fig. 10 which is a block diagram 
illustrating different utilities of genes of the novel group 
of genes of the present invention referred to here as 
VGAM genes and GR genes. 

[0137] The present invention discloses a first plurality of novel 

genes referred to here as VGAM genes, and a second plu- 
rality of operon-like genes referred to here as GR genes, 
each of the GR genes encoding a plurality of VGAM genes. 
The present invention further discloses a very large num- 



ber of known target-genes, which are bound by, and the 
expression of which is modulated by each of the novel 
genes of the present invention. Published scientific data 
referenced by the present invention provides specific, 
substantial, and credible evidence that the abovemen- 
tioned target genes modulated by novel genes of the 
present invention, are associated with various diseases. 
Specific novel genes of the present invention, target genes 
thereof and diseases associated therewith, are described 
hereinbelow with reference to Figs. 15 through 29 It is 
therefore appreciated that a function of VGAM genes and 
GR genes of the present invention is modulation of ex- 
pression of target genes related to known diseases, and 
that therefore utilities of novel genes of the present in- 
vention include diagnosis and treatment of the above- 
mentioned diseases. Fig. 10 describes various types of di- 
agnostic and therapeutic utilities of novel genes of the 
present invention. 
[0138] a utility of novel genes of the present invention is detec- 
tion of VGAM genes and of GR genes. It is appreciated 
that since VGAM genes and GR genes modulate expres- 
sion of disease related target genes, that detection of ex- 
pression of VGAM genes in clinical scenarios associated 



with said diseases is a specific, substantial and credible 
utility. Diagnosis of novel genes of the present invention 
may preferably be implemented by RNA expression detec- 
tion techniques, including but not limited to biochips, as 
is well known in the art. Diagnosis of expression of genes 
of the present invention may be useful for research pur- 
poses, in order to further understand the connection be- 
tween the novel genes of the present invention and the 
abovementioned related diseases, for disease diagnosis 
and prevention purposes, and for monitoring disease 
progress. 

[0139] Another utility of novel genes of the present invention is 
anti-VGAM gene therapy, a mode of therapy which allows 
up regulation of a disease related target-gene of a novel 
VGAM gene of the present invention, by lowering levels of 
the novel VGAM gene which naturally inhibits expression 
of that target gene. This mode of therapy is particularly 
useful with respect to target genes which have been 
shown to be under-expressed in association with a spe- 
cific disease. Anti-VGAM gene therapy is further discussed 
hereinbelow with reference to Figs. llAand 11B. 

[0140] a further utility of novel genes of the present invention is 
VGAM replacement therapy, a mode of therapy which 



achieves down regulation of a disease related target-gene 
of a novel VGAM gene of the present invention, by raising 
levels of the VGAM gene which naturally inhibits expres- 
sion of that target gene. This mode of therapy is particu- 
larly useful with respect to target genes which have been 
shown to be over-expressed in association with a specific 
disease. VGAM replacement therapy involves introduction 
of supplementary VGAM gene products into a cell, or 
stimulation of a cell to produce excess VGAM gene prod- 
ucts. VGAM replacement therapy may preferably be 
achieved by transfecting cells with an artificial DNA 
molecule encoding a VGAM gene, which causes the cells 
to produce the VGAM gene product, as is well known in 
the art. 

[0141] yet a further utility of novel genes of the present invention 
is modified VGAM therapy. Disease conditions are likely to 
exist, in which a mutation in a binding site of a VGAM 
gene prevents natural VGAM gene to effectively bind in- 
hibit a disease related target-gene, causing up regulation 
of that target gene, and thereby contributing to the dis- 
ease pathology. In such conditions, a modified VGAM 
gene is designed which effectively binds the mutated 
VGAM binding site, i.e. is an effective anti-sense of the 



mutated VGAM binding site, and is introduced in disease 
effected cells. Modified VGAM therapy is preferably 
achieved by transfecting cells with an artificial DNA 
molecule encoding the modified VGAM gene, which 
causes the cells to produce the modified VGAM gene 
product, as is well known in the art. 

[0142] An additional utility of novel genes of the present inven- 
tion is induced cellular differentiation therapy. As aspect 
of the present invention is finding genes which determine 
cellular differentiation, as described hereinabove with ref- 
erence to Fig. 11. Induced cellular differentiation therapy 
comprises transfection of cell with such VGAM genes 
thereby determining their differentiation as desired. It is 
appreciated that this approach may be widely applicable, 
inter alia as a means for auto transplantation harvesting 
cells of one cell-type from a patient, modifying their dif- 
ferentiation as desired, and then transplanting them back 
into the patient. It is further appreciated that this ap- 
proach may also be utilized to modify cell differentiation 
in vivo, by transfecting cells in a genetically diseased tis- 
sue with a cell-differentiation determining VGAM gene, 
thus stimulating these cells to differentiate appropriately. 

[0143] Reference is now made to Figs. 11A and 11B, simplified 



diagrams which when taken together illustrate anti-VGAM 
gene therapy mentioned hereinabove with reference to 
Fig. 10. A utility of novel genes of the present invention is 
anti-VGAM gene therapy, a mode of therapy which allows 
up regulation of a disease related target-gene of a novel 
VGAM gene of the present invention, by lowering levels of 
the novel VGAM gene which naturally inhibits expression 
of that target gene. Fig. 11A shows a normal VGAM gene, 
inhibiting translation of a target gene of VGAM gene, by 
binding to a BINDING SITE found in an untranslated region 
of TARGET RNA, as described hereinabove with reference 
to Fig. 1. 

[0144] pig. 11B shows an example of anti-VGAM gene therapy. 
ANTI-VGAM RNA is short artificial RNA molecule the se- 
quence of which is an anti-sense of VGAM RNA. Anti- 
VGAM treatment comprises transfecting diseased cells 
with ANTI-VGAM RNA, or with a DNA encoding thereof. 
The ANTI-VGAM RNA binds the natural VGAM RNA, 
thereby preventing binding of natural VGAM RNA to its 
BINDING SITE. This prevents natural translation inhibition 
of TARGET RNA by VGAM RNA, thereby up regulating ex- 
pression of TARGET PROTEIN. 

[° 145 ] It is appreciated that anti-VGAM gene therapy is particu- 



larly useful with respect to target genes which have been 
shown to be under-expressed in association with a spe- 
cific disease. 

[0146] Reference is now made to Fig. 12A which is an annotated 
sequence of an EST comprising a novel gene detected by 
the gene detection system of the present invention. Fig. 
12A shows the nucleotide sequence of a known human 
non-protein coding EST (Expressed Sequence Tag), identi- 
fied as EST72223. It is appreciated that the sequence of 
this EST comprises sequences of one known miRNA gene, 
identified as MIR98, and of one novel GAM gene, referred 
to here as GAM24, detected by the bioinformatic gene de- 
tection system of the present invention, described herein- 
above with reference to Fig. 2. 

[0147] Reference is now made to Figs. 12B and 12C that are pic- 
tures of laboratory results, which when taken together 
demonstrate laboratory confirmation of expression of the 
bioinformatically detected novel gene of Fig. 12A. Refer- 
ence is now made to Fig. 12B which is a Northern blot 
analysis of MIR-98 and EST72223 transcripts. MIR-98 and 
EST72223 were reacted with MIR-98 and GAM24 probes 
as indicated in the figure. It is appreciated that the probes 
of both MIR-98 and GAM24 reacted with EST72223, indi- 



eating that EST72223 contains the sequences of MIR-98 
and of GAM24. It is further appreciated that the probe of 
GAM24 does not cross-react with MIR-98. 
[0148] Reference is now made to Fig. 12C. A Northern blot analy- 
sis of EST72223 and MIR-98 transfections were per- 
formed, subsequently marking RNA by the MIR-98 and 
GAM24 probes . Left, Northern reacted with MIR-98, 
Right, Northern reacted with GAM24. The molecular Sizes 
of EST72223, MIR-98 and GAM 2 4 are indicated by arrows. 
Hela are control cells that have not been introduced to 
exogenous RNA. EST and MIR-98 Transfections are RNA 
obtained from Hela transfected with EST72223 and MIR- 
98, respectively. MIR-98 and EST are the transcripts used 
for the transfection experiment. The results indicate that 
EST72223, when transfected into Hela cells, is cut yielding 
known miRNA gene MIR-98 and novel miRNA gene 
GAM24. 

[0149] Reference is now made to Fig. 12D, which is a Northern 
blot of a lisate experiment with MIR-98 and GAM24. 
Northern blot analysis of hairpins in EST72223 . Left, 
Northern reacted with predicted Mir-98 hairpin probe, 
Right, Northern reacted with predicted GAM24 hairpin 
probe. The molecular size of EST Is indicated by arrow. 



The molecular sizes of Mir-98 and GAM24 are 80nt and 
lOOnt, respectively as indicated by arrows. The 22nt 
molecular marker is indicated by arrow. 1-Hela lysate; 
2-EST incubated 4h with Hela lysate; 3-EST without lysate; 

4- Mir transcript incubated 4h with Hela lysate; 5-Mir 
transcript incubated overnight with Hela lysate; 6- Mir 
transcript without lysate; 7-RNA extracted from Hela cells 
following transfection with Mir transcript. 

[0150] Technical methods used in experiments, the results of 
which are depicted in Figs. 12B, 12C and 12D are as fol- 
lows: 

[0151] Transcript preparations: D\gox\gen\n (DIG) labeled transcripts 
were prepared from EST72223 (TIGER), MIR98 and pre- 
dicted precursor hairpins by using a DIG RNA labeling kit 
(Roche Molecular Biochemicals) according to the manufac- 
ture'^ protocol. Briefly, PCR products with T7 promoter at 
the 5" end or T3 promoter at the 3"end were prepared 
from each DNA in order to use it as a template to prepare 
sense and antisense transcripts, respectively. MIR-98 was 
amplified using EST72223 as a templet with T7miR98 for- 
ward primer: 

5- "TAATACGACTCACTATAGGGTGAGGTAGTAAGTTGTATT 
GTT-3"and T3miR98 revse primer: 



S'-AATTAACCCTCACTAAAGGGAAAGTAGTAAGTrGTATAG 
TT-3"EST72223 was amplified with T7-EST 72223 forward 
p r i m e r : 5 "-TAATACG ACTC ACTATAG G CCCTTATTAG AG GAT 
TCTGCT-3"and T3-EST72223 reverse 
primer: 5"-AATTAACCCTCACTAAAGGT I I I I I I TTCCTGAG 
ACAGAGT-3" Bet-4 was amplified using EST72223 as a 
templet with Bet-4 forward primer: 

5"-GAGGCAGGAGAATTGCTTGA- 3" and T3-EST72223 re- 
verse 

p r i m e r : 5 " - AATTAACCCTC ACTAAAG G C CTG AG AC AG AGTCT 
TGCTC-3"The PCR products were cleaned and used for 
DIG-labeled or unlabeled transcription reactions with the 
appropriate polymerase. For transfection experiments, 
CAP reaction was performed by using a mMassage mMa- 
chine kit (Ambion). 
[0152] Transfection procedure: Transfection of Hela cells was per- 
formed by using TransMessenger reagent (Qiagen) ac- 
cording to the manufacture'^ protocol. Briefly, Hela cells 
were seeded to l-2x 10 A 6 cells per plate a day before 
transfection. Two ug RNA transcripts were mixed with 8m I 
Enhancer in a final volume of lOOul, mixed and incubated 
at room temperature for 5 min. 16ul TransMessenger 
reagent was added to the RNA-Enhancer, mixed and incu- 



bated for additional 10 min. Cell plates were washed with 
sterile PBS twice and then incubated with the transfection 
mix diluted with 2.5 ml DMEM medium without serum. 
Cells were incubated with transfection mix for three hours 
under their normal growth condition (370C and 5% C02) 
before the transfection mix was removed and a fresh 
DMEM medium containing serum was added to the cells. 
Cells were left to grow 48 hours before harvesting. 
[0153] Target RNA cleavage assay: Cap-labeled target RNAs were 

TM 

generated using mMessage mMachine (Ambion). Caped 
RNA transcripts were preincubated at 30°C for 15 min in 
supplemented Hela S100 obtained from Computer Cell 
Culture, Mos, Belgium. After addition of all components, 
final concentrations were lOOmM target RNA, lm M ATP, 
0.2mM GTP, lOU/ml RNasin, 30ug/ml creatine kinase, 
25mM creatine phosphate, and 50%S100 extract. Incuba- 
tion was continued for 4 hours to overnight. Cleavage re- 
action was stopped by the addition of 8 volumes of pro- 
teinase K buffer (200Mm Tris-Hcl, pH 7.5, 25m M EDTA, 
300mM NaCI, and 2% SDS). Proteinase K, dissolved in 
50mM Tris-HCI, pH 8, 5m M CaCI2, and 50% glycerol, was 
added to a final concentration of 0.6 mg/ml. Sample were 
subjected to phenol/chlorophorm extraction and kept 



frozen until analyzed by urea-TBE PAGE. 
[0154] Northern analysis: RNAs were extracted from cells by using 
Tri-reagent according to the manufacture'^ protocol. The 
RNAs were dissolved in water and heated to 650C to dis- 
rupt any association of the 25nt RNA with larger RNA 
molecules. RNA were placed on ice and incubated for 30 
min with PEG (MW=8000) in a final concentration of 5% 
and NaCI in a final concentration of 0.5M to precipitate 
high molecular weight nucleic acid. The RNAs were cen- 
trifuged at 10,000xg for 10 min to pellet the high molec- 
ular weight nucleic acid. The supernatant containing the 
low molecular weight RNAs was collected and three vol- 
umes of ethanol was added. The RNAs were placed at - 
200C for at least two hours and then centrifuged at 
10,000xg for 10 min. The pellets were dissolved in Urea- 
TBE buffer (lXtbe, 7M urea) for further analysis by a 
Northern blot. 

[0155] rna samples were boiled for 5 min before loading on 
15%-8% polyacrylamide (19:1) gels containing 7M urea 
and lxTBE. Gels were run in lxTBE at a constant voltage 
of 300V and then transferred into a nylon membrane. The 
membrane was exposed to 3min ultraviolet light to cross 
link the RNAs to the membrane. Hybridization was per- 



formed overnight with DIG-labeled probes at 420C. Mem- 
branes were washed twice with SSCx2 and 0.2% SDS for 10 
min. at 420C and then washed twice with SSCx0.5 for 5 
min at room temperature. The membrane was then devel- 
oped by using a DIG luminescent detection kit (Roche) us- 
ing anti DIG and CSPD reaction, according to the manu- 
facture'^ protocol. 

[0156] it is appreciated that the data presented in Figs. 12A, 12B, 
12C and 12D, when taken together validate the function 
of the bioinformatic gene detection engine 100 of Fig. 2. 
Fig. 12A shows a novel GAM gene bioinformatically de- 
tected by the bioinformatic gene detection engine 100, 
and Figs. 12B, 12C and 12D show laboratory confirmation 
of the expression of this novel gene. This is in accord with 
the engine training and validation methodology described 
hereinabove with reference to Fig. 3. 

[0157] Reference is now made to Fig. 13A which is an annotated 
sequence of an EST comprising a novel gene detected by 
the gene detection system of the present invention. Fig. 
13A shows the nucleotide sequence of a known human 
non-protein coding EST (Expressed Sequence Tag), identi- 
fied as EST 7929020. It is appreciated that the sequence 
of this EST comprises sequences of two novel GAM genes, 



referred to here as GAM23 and GAM25, detected by the 
bioinformatic gene detection system of the present inven- 
tion, described hereinabove with reference to Fig. 2. 

[0158] Reference is now made to Fig. 13B which presents pic- 
tures of laboratory results, that demonstrate laboratory 
confirmation of expression of the bioinformatically de- 
tected novel gene of Fig. 13A. Northern blot analysis of 
hairpins in EST7929020. Left, Northern reacted with pre- 
dicted GAM25 hairpin probe, Right, Northern reacted with 
predicted GAM23 hairpin probe. The molecular size of EST 
is indicated by arrow. The molecular sizes of GAM23 and 
GAM25 are 60nt, as indicated by arrow. The 22nt molecu- 
lar marker is indicated by arrow. 1-Hela lysate; 2- EST in- 
cubated 4h with Hela lysate ; 3- EST incubated overnight 
with Hela lysate; 4-EST without lysate; 5-GAM transcript; 
6- GAM 22nt marker;7-GAM PCR probe; 8-RNA from con- 
trol Hela cells; 9-RNA extracted from Hela cells following 
transfection with EST. 

[0159] Reference is now made to Fig. 13C which is a picture of a 
Northern blot confirming Endogenous expression of 
bioinformatically detected gene GAM25 of Fig. 13A from 
in Hela cells. Northern was reacted with a predicted 
GAM25 hairpin probe. The molecular size of EST7929020 



is indicated. The molecular sizes of GAM25 is 58nt, as in- 
dicated. A 19nt DNA oligo molecular marker is indicated. 
Endogenous expression of GAM25 in Hela total RNA frac- 
tion and in S-100 fraction is indicated by arrows. 
1-GAM25 transcript; 2- GAM 2 5 DNA oligo marker; 3 -RNA 
from control Hela cells; 4-RNA extracted from Hela cells 
following transfection with EST; 5- RNA extracted from S- 
100 Hela lysate. 

[0160] Reference is now made to Fig. 14A which is an annotated 
sequence of an EST comprising a novel gene detected by 
the gene detection system of the present invention. Fig. 
14A shows the nucleotide sequence of a known human 
non-protein coding EST (Expressed Sequence Tag), identi- 
fied as EST 1388749. It is appreciated that the sequence 
of this EST comprises sequence of a novel GAM gene, re- 
ferred to here as GAM26, detected by the bioinformatic 
gene detection system of the present invention, described 
hereinabove with reference to Fig. 2. 

[0161] Reference is now made to Fig. 14B which is a picture of 
Northern blot analysis, confirming expression of novel 
bioinformatically detected gene GAM26, and natural pro- 
cessing thereof from EST1388749. Northern reacted with 
predicted GAM26 hairpin probe. The molecular size of EST 



is indicated by arrow. The molecular sizes of GAM26 is 
130nt, as indicated by arrow. The 22nt molecular marker 
is indicated by arrow. 1-Hela lysate; 2-EST incubated 4h 
with Hela lysate; 3- EST incubated overnight with Hela 
lysate; 4-EST without lysate; 5-GAM transcript; 6- GAM 
22nt marker; 7-GAM PCR probe. 

[0162] pig. 1 further provides a conceptual description of a novel 
bioinformatically detected viral gene of the present inven- 
tion, referred to here as Viral Genomic Address Messenger 
15 (VGAM15) viral gene, which modulates expression of 
respective host target genes thereof, the function and 
utility of which host target genes is known in the art. 

[0163] VGAM15 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM15 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0164] VG AM 1 5 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM15 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0165] VGAM 1 5 gene, herein designated VGAM GENE, encodes a 
VGAM 15 precursor RNA, herein designated VGAM PRE- 



CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM15 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM15 precursor RNA is 
designated SEQID:1, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 1 is located at position 7156 relative to the 
genome of Human immunodeficiency virus 1. 

[0166] VGAM 1 5 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 15 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0167] A n enzyme complex designated DICER COMPLEX, dices 
the VGAM15 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 15 RNA, 
herein designated VGAM RNA, a single stranded ~22 nt 



long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7200%) nucleotide 
sequence of VGAM15 RNA is designated SEQ ID:16, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0168] VG AM 1 5 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 1 5 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 1 5 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0 1 69] VGAM 1 5 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM 15 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM 15 RNA, herein designated VGAM RNA, 



is an accurate or a partial inversed- reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VG AM 1 5 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM15 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 
[0170] The complementary binding of VGAM15 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 1 5 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM15 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM15 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 



therefore outlined by a broken line. 

[° 171 ] It is appreciated that VGAM15 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM15 host target genes. The mRNA of 
each one of this plurality of VGAM15 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM15 RNA, herein designated VGAM 
RNA, and which when bound by VGAM15 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM15 host target proteins. 

[0172] it i S further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM15 gene, herein designated VGAM GENE, on one or 
more VGAM15 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 



of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun C, Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[° 173 ] It is yet further appreciated that a function of VGAM15 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM15 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM15 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM15 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0174] Nucleotide sequences of the VGAM15 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 1 5 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM 15 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM15 are further de- 
scribed hereinbelow with reference to Table 1. 

[0175] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 



Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VG AM 1 5 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0176] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 16 (VGAM16) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0177] VGAM 16 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM 16 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0 17 8] VGAM 16 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM16 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0179] VGAM16 gene, herein designated VGAM GENE, encodes a 
VGAM 16 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM16 precursor RNA, herein des- 



ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM 16 precursor RNA is 
designated SEQID:2, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:2 is located at position 4668 relative to the 
genome of Human immunodeficiency virus 1. 

[0 1 80] VGAM 16 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 16 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0181] An enzyme complex designated DICER COMPLEX, dices 
the VGAM16 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 16 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 



~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7900%) nucleotide 
sequence of VGAM16 RNA is designated SEQ ID: 17, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0182] VGAM16 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 16 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 16 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0 1 83] VGAM 16 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM 16 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM16 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed- reversed sequence of 
the nucleotide sequence of each of the host target binding 



sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM16 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM 16 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0184] The complementary binding of VGAM16 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 16 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM16 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM 16 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0185] ^ is appreciated that VGAM16 host target gene, herein 



designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM 16 host target genes. The mRNA of 
each one of this plurality of VGAM16 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM16 RNA, herein designated VGAM 
RNA, and which when bound by VGAM16 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM 16 host target proteins. 
[0186] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM 16 gene, herein designated VGAM GENE, on one or 
more VGAM 16 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 



genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0187] it j S y e t further appreciated that a function of VGAM16 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM16 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM16 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM16 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0188] Nucleotide sequences of the VGAM16 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 16 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM 16 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM 16 are further de- 
scribed hereinbelow with reference to Table 1. 

[0189] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 



to VGAM16 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0190] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 17 (VGAM17) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0191] VGAM 17 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM17 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0192] VGAM 17 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM17 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0193] VGAM 17 gene, herein designated VGAM GENE, encodes a 
VGAM 17 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM17 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 



the nucleotide sequence of VGAM17 precursor RNA is 
designated SEQID:3, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:3 is located at position 5919 relative to the 
genome of Human immunodeficiency virus 1. 

[0194] VGAM17 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 17 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0195] An enzyme complex designated DICER COMPLEX, dices 
the VGAM17 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 17 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 



necessary proteins. A probable (over 8400%) nucleotide 
sequence of VGAM17 RNA is designated SEQ ID:18, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0196] VGAM17 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 17 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 17 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0197] VGAM 17 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM 17 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM17 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 



and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VG AM 1 7 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM 17 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0198] The complementary binding of VGAM17 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 17 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM17 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM 17 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0199] it is appreciated that VGAM17 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM 17 host target genes. The mRNA of 



each one of this plurality of VGAM17 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM17 RNA, herein designated VGAM 
RNA, and which when bound by VGAM17 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM 17 host target proteins. 
[0200] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM17 gene, herein designated VGAM GENE, on one or 
more VGAM 17 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 



[0201] ^ is yet further appreciated that a function of VGAM17 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM17 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM17 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM17 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0202] Nucleotide sequences of the VGAM17 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 17 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM 17 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM17 are further de- 
scribed hereinbelow with reference to Table 1. 

[0203] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 17 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 



[0204] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 18 (VGAM18) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0205] VGAM18 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM18 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0206] VGAM18 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM18 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0207] VGAM 18 gene, herein designated VGAM GENE, encodes a 
VGAM 18 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM18 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM18 precursor RNA is 
designated SEQ ID:4, and is provided hereinbelow with 



reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:4 is located at position 1459 relative to the 
genome of Human immunodeficiency virus 1. 

[0208] VGAM18 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 18 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0209] An enzyme complex designated DICER COMPLEX, dices 
the VGAM18 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 18 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7200%) nucleotide 
sequence of VGAM18 RNA is designated SEQ ID:19, and is 



provided hereinbelow with reference to the sequence list- 
ing part. 

[0210] VGAM18 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 18 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 18 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[° 211 ] VGAM 18 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM 18 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM18 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 



meant as an illustration only, and is not meant to be limit- 
ing VGAM18 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM 18 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0212] The complementary binding of VGAM18 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 18 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM18 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM 18 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[° 213 ] It is appreciated that VGAM 18 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM 18 host target genes. The mRNA of 
each one of this plurality of VGAM 18 host target genes 
comprises one or more host target binding sites, each 



having a nucleotide sequence which is at least partly com- 
plementary to VGAM18 RNA, herein designated VGAM 
RNA, and which when bound by VGAM 18 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM 18 host target proteins. 

[° 214 ] It is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM 18 gene, herein designated VGAM GENE, on one or 
more VGAM 18 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0215] | t j S y et further appreciated that a function of VGAM18 is 
inhibition of expression of host target genes, as part of a 



novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM18 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM18 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM18 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0216] Nucleotide sequences of the VGAM18 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 18 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM 18 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM 18 are further de- 
scribed hereinbelow with reference to Table 1. 

[0217] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 18 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0218] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 



invention, referred to here as Viral Genomic Address Mes- 
senger 19 (VGAM19) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0219] VGAM19 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM19 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0220] VGAM19 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM19 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0221] VGAM 19 gene, herein designated VGAM GENE, encodes a 
VGAM 19 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM19 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM 19 precursor RNA is 
designated SEQID:5, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 5 is located at position 2168 relative to the 



genome of Human immunodeficiency virus 1. 

[0222] VGAM19 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 19 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0223] An enzyme complex designated DICER COMPLEX, dices 
the VGAM19 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 19 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 6000%) nucleotide 
sequence of VGAM19 RNA is designated SEQ ID:20, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 



[0 224 ] VGAM19 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 19 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 19 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0225] VGAM 19 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM 19 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM19 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM 19 RNA, herein designated VGAM RNA, may 



have a different number of host target binding sites in 
untranslated regions of a VGAM19 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0226] The complementary binding of VGAM19 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 19 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM19 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM 19 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[° 227 ] It is appreciated that VGAM 19 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM 19 host target genes. The mRNA of 
each one of this plurality of VGAM19 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM19 RNA, herein designated VGAM 



RNA, and which when bound by VGAM19 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM 19 host target proteins. 

[0228] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM 19 gene, herein designated VGAM GENE, on one or 
more VGAM 19 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0229] | t j S y e t further appreciated that a function of VGAM19 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM19 include diagnosis, prevention and 



treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM19 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM19 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0230] Nucleotide sequences of the VGAM19 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 19 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM 19 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM 19 are further de- 
scribed hereinbelow with reference to Table 1. 

[0231] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 19 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0232] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 20 (VGAM20) viral gene, which modulates expres- 



sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0233] VGAM20 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM20 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0234] VGAM20 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM20 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0235] VGAM20 gene, herein designated VGAM GENE, encodes a 
VGAM 20 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM20 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM20 precursor RNA is 
designated SEQ ID:6, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:6 is located at position 587 relative to the 
genome of Human immunodeficiency virus 1. 

[0236] VGAM 20 precursor RNA, herein designated VGAM PRE- 



CURSOR RNA, folds onto itself, forming VGAM20 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 
[0237] A n enzyme complex designated DICER COMPLEX, dices 
the VGAM20 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 20 RNA, 
herein designated VGAM RNA, a single stranded ~22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 8400%) nucleotide 
sequence of VGAM20 RNA is designated SEQ ID:21, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0238] VGAM 20 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 



VGAM20 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 20 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0239] VGAM 20 RNA, herein designated VGAM RNA, binds com- 
plementarity to one or more host target binding sites lo- 
cated in untranslated regions of VGAM20 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM20 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM 20 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM20 host target RNA, herein 



designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0240] The complementary binding of VGAM20 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 20 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM20 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM20 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[° 241 ] It is appreciated that VGAM20 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM20 host target genes. The mRNA of 
each one of this plurality of VGAM20 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM20 RNA, herein designated VGAM 
RNA, and which when bound by VGAM20 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 



respective one or more VGAM20 host target proteins. 

[0242] it j S further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM20 gene, herein designated VGAM GENE, on one or 
more VGAM20 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0243] | t j S y e t further appreciated that a function of VGAM 20 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM20 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 



VGAM20 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM20 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0244] Nucleotide sequences of the VGAM20 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 20 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM20 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM20 are further de- 
scribed hereinbelow with reference to Table 1. 

[0245] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 20 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0246] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 21 (VGAM21) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 



[0247] VGAM21 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM21 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0248] VGAM21 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM2 1 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0249] VGAM 21 gene, herein designated VGAM GENE, encodes a 
VGAM 21 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM21 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM21 precursor RNA is 
designated SEQID:7, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:7 is located at position 7857 relative to the 
genome of Human immunodeficiency virus 1. 

[0250] VGAM 21 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 21 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 



SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 
[0251] An enzyme complex designated DICER COMPLEX, dices 
the VGAM21 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM21 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7700%) nucleotide 
sequence of VGAM21 RNA is designated SEQ ID:22, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0252] VGAM 21 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 21 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 21 host target RNA, herein designated 



VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0253] VGAM 21 RNA, herein designated VGAM RNA, binds com- 
plementarity to one or more host target binding sites lo- 
cated in untranslated regions of VGAM21 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM21 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM 21 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM21 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 



the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0254] The complementary binding of VGAM21 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM21 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM 21 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM21 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0255] it is appreciated that VGAM21 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM21 host target genes. The mRNA of 
each one of this plurality of VGAM2 1 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM21 RNA, herein designated VGAM 
RNA, and which when bound by VGAM21 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM21 host target proteins. 

[0256] it is further appreciated by one skilled in the art that the 



mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM21 gene, herein designated VGAM GENE, on one or 
more VGAM21 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 
[0257] it j S y e t further appreciated that a function of VGAM2 1 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM21 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM21 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM21 binds and 



inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0258] Nucleotide sequences of the VGAM21 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 21 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM21 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM21 are further de- 
scribed hereinbelow with reference to Table 1. 

[0259] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 21 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0260] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 22 (VGAM22) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0261] VGAM22 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 



method by which VGAM22 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0262] VGAM22 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM22 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0263] VGAM22 gene, herein designated VGAM GENE, encodes a 
VGAM22 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM22 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM22 precursor RNA is 
designated SEQ ID:8, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:8 is located at position 8292 relative to the 
genome of Human immunodeficiency virus 1. 

[0264] VGAM 2 2 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 2 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 



cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 
[0265] An enzyme complex designated DICER COMPLEX, dices 
the VGAM22 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM22 RNA, 
herein designated VGAM RNA, a single stranded ~22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7600%) nucleotide 
sequence of VGAM22 RNA is designated SEQ ID:23, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0266] VGAM 2 2 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 2 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 2 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 



lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0267] VGAM22 RNA, herein designated VGAM RNA, binds com- 
plementarity to one or more host target binding sites lo- 
cated in untranslated regions of VGAM22 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM22 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM22 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM22 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 



gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0268] The complementary binding of VGAM22 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM22 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM22 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM22 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0269] it is appreciated that VGAM22 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM 2 2 host target genes. The mRNA of 
each one of this plurality of VGAM22 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM22 RNA, herein designated VGAM 
RNA, and which when bound by VGAM22 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM22 host target proteins. 

[0270] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 



VGAM22 gene, herein designated VGAM GENE, on one or 
more VGAM22 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 
[° 271 ] It is yet further appreciated that a function of VGAM 2 2 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM22 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM22 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM22 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 



[0272] Nucleotide sequences of the VGAM22 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 2 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM22 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM22 are further de- 
scribed hereinbelow with reference to Table 1. 

[0273] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 2 2 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0274] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 23 (VGAM23) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0275] VGAM23 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM23 was detected is described 
hereinabove with reference to Figs. 2-8. 



[0 27 6] VGAM23 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM23 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0277] VGAM 2 3 gene, herein designated VGAM GENE, encodes a 
VGAM 2 3 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM23 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM23 precursor RNA is 
designated SEQID:9, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID:9 is located at position 5531 relative to the 
genome of Human immunodeficiency virus 1. 

[0 27 8] VGAM 2 3 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 3 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 



RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 
[0279] An enzyme complex designated DICER COMPLEX, dices 
the VGAM23 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM23 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 6900%) nucleotide 
sequence of VGAM23 RNA is designated SEQ ID:24, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0280] VGAM 2 3 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 3 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 3 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 



spectively. 

[028 1 ] VGAM23 RNA, herein designated VGAM RNA, binds com- 
plementarity to one or more host target binding sites lo- 
cated in untranslated regions of VGAM23 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM23 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM 2 3 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM23 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0282] The complementary binding of VGAM23 RNA, herein des- 



ignated VGAM RNA, to host target binding sites on 
VGAM23 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM 2 3 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM23 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0283] it is appreciated that VGAM23 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM23 host target genes. The mRNA of 
each one of this plurality of VGAM23 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM23 RNA, herein designated VGAM 
RNA, and which when bound by VGAM23 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM23 host target proteins. 

[0284] | t j S further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM23 gene, herein designated VGAM GENE, on one or 
more VGAM23 host target gene, herein designated VGAM 



HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0285] it is yet further appreciated that a function of VGAM23 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM23 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM23 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM23 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0286] Nucleotide sequences of the VGAM23 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 



diced VGAM23 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM23 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM23 are further de- 
scribed hereinbelow with reference to Table 1. 

[0287] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 2 3 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0288] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 24 (VGAM24) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0289] VGAM24 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM24 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0290] VGAM 2 4 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 



ciency virus 1. VGAM24 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0291] VGAM24 gene, herein designated VGAM GENE, encodes a 
VGAM 2 4 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM24 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM24 precursor RNA is 
designated SEQID:10, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 10 is located at position 1301 relative to 
the genome of Human immunodeficiency virus 1. 

[0292] VGAM 2 4 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 4 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 



the second half thereof. 
[0293] An enzyme complex designated DICER COMPLEX, dices 
the VGAM24 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM24 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 8700%) nucleotide 
sequence of VGAM24 RNA is designated SEQ ID:25, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0294] VGAM 2 4 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 4 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 4 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0295] VGAM 2 4 RNA, herein designated VGAM RNA, binds com- 



plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM24 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM24 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM 2 4 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM24 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 
[0296] The complementary binding of VGAM24 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 2 4 host target RNA, herein designated VGAM HOST 



TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM24 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM24 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[° 297 ] It is appreciated that VGAM24 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM24 host target genes. The mRNA of 
each one of this plurality of VGAM24 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM24 RNA, herein designated VGAM 
RNA, and which when bound by VGAM24 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM24 host target proteins. 

[0298] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM24 gene, herein designated VGAM GENE, on one or 
more VGAM24 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 



reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun C, Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0299] | t j S y e t further appreciated that a function of VGAM24 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM24 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM24 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM24 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0300] Nucleotide sequences of the VGAM24 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 4 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 



VGAM24 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM 2 4 are further de- 
scribed hereinbelow with reference to Table 1. 

[0301] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 2 4 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0302] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 25 (VGAM25) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0303] VGAM25 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM25 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0304] VGAM 2 5 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM25 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 



tained in the human genome. 

[0305] VGAM25 gene, herein designated VGAM GENE, encodes a 
VGAM25 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM25 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM25 precursor RNA is 
designated SEQID:11, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 11 is located at position 9080 relative to 
the genome of Human immunodeficiency virus 1. 

[0306] VGAM 2 5 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 5 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0307] An enzyme complex designated DICER COMPLEX, dices 



the VGAM25 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM25 RNA, 
herein designated VGAM RNA, a single stranded ~22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 9000%) nucleotide 
sequence of VGAM25 RNA is designated SEQ ID:26, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0308] VGAM 2 5 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 5 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 5 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0309] VGAM 2 5 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM25 host target RNA, 



herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM25 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM 2 5 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM25 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 
[0310] The complementary binding of VGAM25 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 2 5 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM25 host tar- 



get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM25 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[° 311 ] It is appreciated that VGAM25 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM25 host target genes. The mRNA of 
each one of this plurality ofVGAM25 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM25 RNA, herein designated VGAM 
RNA, and which when bound by VGAM25 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM25 host target proteins. 

[° 312 ] It is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM25 gene, herein designated VGAM GENE, on one or 
more VGAM25 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 



for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun C, Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0313] it is yet further appreciated that a function of VGAM25 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM25 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM25 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM25 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0314] Nucleotide sequences of the VGAM25 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 5 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM25 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM25 are further de- 



scribed hereinbelow with reference to Table 1. 

[0315] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM25 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0316] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 26 (VGAM26) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0317] VGAM26 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM26 was detected is described 
hereinabove with reference to Figs. 2-8. 

[03 1 8] VGAM 2 6 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM26 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0319] VGAM26 gene, herein designated VGAM GENE, encodes a 



VGAM26 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VCAM26 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM26 precursor RNA is 
designated SEQID:12, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 12 is located at position 2049 relative to 
the genome of Human immunodeficiency virus 1. 

[0320] VGAM 2 6 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 6 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0321] An enzyme complex designated DICER COMPLEX, dices 
the VGAM26 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 2 6 RNA, 



herein designated VGAM RNA, a single stranded ~22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7200%) nucleotide 
sequence of VGAM26 RNA is designated SEQ ID:27, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0322] VGAM 2 6 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 6 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 6 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0323] VGAM 2 6 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM26 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 



sequence of VGAM26 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM26 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM26 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 
[0324] The complementary binding of VGAM26 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 2 6 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM26 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM26 host target protein, herein designated VGAM 



HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0325] it is appreciated that VGAM26 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM26 host target genes. The mRNA of 
each one of this plurality of VGAM26 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM26 RNA, herein designated VGAM 
RNA, and which when bound by VGAM26 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM26 host target proteins. 

[0326] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM26 gene, herein designated VGAM GENE, on one or 
more VGAM26 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 



believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun C, Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0327] | t j S yet further appreciated that a function of VGAM26 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM26 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM26 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM26 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0328] Nucleotide sequences of the VGAM26 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 6 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM26 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM26 are further de- 
scribed hereinbelow with reference to Table 1. 

[0329] Nucleotide sequences of host target binding sites, such as 



BINDING SITE-I, BINDING SITE-II and BINDING SITE-IN of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM26 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0330] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 27 (VGAM27) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0331] VGAM27 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM27 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0332] VGAM27 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM27 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0333] VGAM27 gene, herein designated VGAM GENE, encodes a 
VGAM27 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 



most ordinary genes, VGAM27 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VCAM27 precursor RNA is 
designated SEQID:13, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 13 is located at position 1810 relative to 
the genome of Human immunodeficiency virus 1. 

[0334] VGAM 2 7 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 7 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0335] An enzyme complex designated DICER COMPLEX, dices 
the VGAM27 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 2 7 RNA, 
herein designated VGAM RNA, a single stranded ~22 nt 
long RNA segment. As is known in the art, dicing of a 



hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 6000%) nucleotide 
sequence of VGAM27 RNA is designated SEQ ID:28, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0336] VGAM27 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 7 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 7 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0337] VGAM 2 7 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM27 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM27 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 



the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM27 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM27 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 
[0338] The complementary binding of VGAM27 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 2 7 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM27 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM27 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 



[0339] ^ is appreciated that VGAM27 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM27 host target genes. The mRNA of 
each one of this plurality of VGAM27 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM27 RNA, herein designated VGAM 
RNA, and which when bound by VGAM27 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM27 host target proteins. 

[0340] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM27 gene, herein designated VGAM GENE, on one or 
more VGAM27 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 



cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun C, Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[° 341 ] It is yet further appreciated that a function of VGAM27 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM27 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM27 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM27 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0342] Nucleotide sequences of the VGAM27 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 7 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM27 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM27 are further de- 
scribed hereinbelow with reference to Table 1. 

[0343] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-IN of 
Fig. 1, found on, and schematic representation of the 



complementarity of each of these host target binding sites 
to VGAM27 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0344] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 28 (VGAM28) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0345] VGAM28 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM 2 8 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0346] VGAM 2 8 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM28 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0347] VGAM 2 8 gene, herein designated VGAM GENE, encodes a 
VGAM 2 8 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM28 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 



tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VCAM28 precursor RNA is 
designated SEQID:14, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 14 is located at position 728 relative to the 
genome of Human immunodeficiency virus 1. 

[0348] VGAM28 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 8 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0349] An enzyme complex designated DICER COMPLEX, dices 
the VGAM28 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 2 8 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 



comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7600%) nucleotide 
sequence of VGAM28 RNA is designated SEQ ID:29, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0350] VGAM28 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 8 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 8 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0351] VGAM 2 8 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM28 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM28 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 



binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 
number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM28 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM28 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0352] The complementary binding of VGAM28 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 2 8 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM28 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM28 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0353] it is appreciated that VGAM28 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 



a plurality of VGAM28 host target genes. The mRNA of 
each one of this plurality of VCAM28 host target genes 
comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM28 RNA, herein designated VGAM 
RNA, and which when bound by VGAM28 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM28 host target proteins. 
[0354] it j S further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM28 gene, herein designated VGAM GENE, on one or 
more VGAM28 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 



Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0355] | t j S y et further appreciated that a function of VGAM28 is 
inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM28 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM28 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM28 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0356] Nucleotide sequences of the VGAM28 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 8 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM28 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM28 are further de- 
scribed hereinbelow with reference to Table 1. 

[0357] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 2 8 RNA, herein designated VGAM RNA, are de- 



scribed hereinbelow with reference to Table 2. 

[0358] pig. 1 further provides a conceptual description of another 
novel bioinformatically detected viral gene of the present 
invention, referred to here as Viral Genomic Address Mes- 
senger 29 (VGAM29) viral gene, which modulates expres- 
sion of respective host target genes thereof, the function 
and utility of which host target genes is known in the art. 

[0359] VGAM29 is a novel bioinformatically detected regulatory, 
non protein coding, viral micro RNA (miRNA) gene. The 
method by which VGAM29 was detected is described 
hereinabove with reference to Figs. 2-8. 

[0360] VGAM29 gene, herein designated VGAM GENE, is a viral 
gene contained in the genome of Human immunodefi- 
ciency virus 1. VGAM29 host target gene, herein desig- 
nated VGAM HOST TARGET GENE, is a human gene con- 
tained in the human genome. 

[0361] VGAM29 gene, herein designated VGAM GENE, encodes a 
VGAM 2 9 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA. Similar to other miRNA genes, and unlike 
most ordinary genes, VGAM29 precursor RNA, herein des- 
ignated VGAM PRECURSOR RNA, does not encode a pro- 
tein. A nucleotide sequence identical or highly similar to 
the nucleotide sequence of VGAM29 precursor RNA is 



designated SEQID:15, and is provided hereinbelow with 
reference to the sequence listing part. Nucleotide se- 
quence SEQ ID: 15 is located at position 5471 relative to 
the genome of Human immunodeficiency virus 1. 

[0362] VGAM29 precursor RNA, herein designated VGAM PRE- 
CURSOR RNA, folds onto itself, forming VGAM 2 9 folded 
precursor RNA, herein designated VGAM FOLDED PRECUR- 
SOR RNA, which has a two-dimensional hairpin structure. 
As is well known in the art, this hairpin structure, is typi- 
cal of RNA encoded by miRNA genes, and is due to the 
fact that the nucleotide sequence of the first half of the 
RNA encoded by a miRNA gene is an accurate or partial 
inversed-reversed sequence of the nucleotide sequence of 
the second half thereof. 

[0363] An enzyme complex designated DICER COMPLEX, dices 
the VGAM29 folded precursor RNA, herein designated 
VGAM FOLDED PRECURSOR RNA, into VGAM 2 9 RNA, 
herein designated VGAM RNA, a single stranded -22 nt 
long RNA segment. As is known in the art, dicing of a 
hairpin structured RNA precursor product into a short 
~22nt RNA segment is catalyzed by an enzyme complex 
comprising an enzyme called Dicer together with other 
necessary proteins. A probable (over 7000%) nucleotide 



sequence of VGAM29 RNA is designated SEQ ID:30, and is 
provided hereinbelow with reference to the sequence list- 
ing part. 

[0364] VGAM29 host target gene, herein designated VGAM HOST 
TARGET GENE, encodes a corresponding messenger RNA, 
VGAM 2 9 host target RNA, herein designated VGAM HOST 
TARGET RNA. VGAM 2 9 host target RNA, herein designated 
VGAM HOST TARGET RNA, comprises three regions, as is 
typical of mRNA of a protein coding gene: a 5 untrans- 
lated region, a protein coding region and a 3 untranslated 
region, designated 5UTR, PROTEIN CODING and 3UTR re- 
spectively. 

[0365] VGAM 2 9 RNA, herein designated VGAM RNA, binds com- 
plementarily to one or more host target binding sites lo- 
cated in untranslated regions of VGAM29 host target RNA, 
herein designated VGAM HOST TARGET RNA. This com- 
plementary binding is due to the fact that the nucleotide 
sequence of VGAM29 RNA, herein designated VGAM RNA, 
is an accurate or a partial inversed-reversed sequence of 
the nucleotide sequence of each of the host target binding 
sites. As an illustration, Fig. 1 shows 3 such host target 
binding sites, designated BINDING SITE I, BINDING SITE II 
and BINDING SITE III respectively. It is appreciated that the 



number of host target binding sites shown in Fig. 1 is 
meant as an illustration only, and is not meant to be limit- 
ing VGAM29 RNA, herein designated VGAM RNA, may 
have a different number of host target binding sites in 
untranslated regions of a VGAM29 host target RNA, herein 
designated VGAM HOST TARGET RNA. It is further appre- 
ciated that while Fig. 1 depicts host target binding sites in 
the 3UTR region, this is meant as an example only these 
host target binding sites may be located in the 3UTR re- 
gion, the 5UTR region, or in both 3UTR and 5UTR regions. 

[0366] The complementary binding of VGAM29 RNA, herein des- 
ignated VGAM RNA, to host target binding sites on 
VGAM 2 9 host target RNA, herein designated VGAM HOST 
TARGET RNA, such as BINDING SITE I, BINDING SITE II and 
BINDING SITE III, inhibits translation of VGAM29 host tar- 
get RNA, herein designated VGAM HOST TARGET RNA, 
into VGAM29 host target protein, herein designated VGAM 
HOST TARGET PROTEIN. VGAM host target protein is 
therefore outlined by a broken line. 

[0367] | t j S appreciated that VGAM29 host target gene, herein 

designated VGAM HOST TARGET GENE, in fact represents 
a plurality of VGAM29 host target genes. The mRNA of 
each one of this plurality of VGAM29 host target genes 



comprises one or more host target binding sites, each 
having a nucleotide sequence which is at least partly com- 
plementary to VGAM29 RNA, herein designated VGAM 
RNA, and which when bound by VGAM29 RNA, herein 
designated VGAM RNA, causes inhibition of translation of 
respective one or more VGAM29 host target proteins. 

[0368] it is further appreciated by one skilled in the art that the 
mode of translational inhibition illustrated by Fig. 1 with 
specific reference to translational inhibition exerted by 
VGAM29 gene, herein designated VGAM GENE, on one or 
more VGAM29 host target gene, herein designated VGAM 
HOST TARGET GENE, is in fact common to other known 
non-viral miRNA genes. As mentioned hereinabove with 
reference to the background section, although a specific 
complementary binding site has been demonstrated only 
for some of the known miRNA genes (primarily Lin-4 and 
Let- 7), all other recently discovered miRNA genes are also 
believed by those skilled in the art to modulate expression 
of other genes by complementary binding, although spe- 
cific complementary binding sites of these other miRNA 
genes have not yet been found (Ruvkun G., Perspective: 
Glimpses of a tiny RNA world, Science 294,779 (2001)). 

[0369] it is yet further appreciated that a function of VGAM29 is 



inhibition of expression of host target genes, as part of a 
novel viral mechanism of attacking a host. Accordingly, 
utilities of VGAM29 include diagnosis, prevention and 
treatment of viral infection by Human immunodeficiency 
virus 1. Specific functions, and accordingly utilities, of 
VGAM29 correlate with, and may be deduced from, the 
identity of the host target genes which VGAM29 binds and 
inhibits, and the function of these host target genes, as 
elaborated hereinbelow. 

[0370] Nucleotide sequences of the VGAM29 precursor RNA, 
herein designated VGAM PRECURSOR RNA, and of the 
diced VGAM 2 9 RNA, herein designated VGAM RNA, and a 
schematic representation of the secondary folding of 
VGAM29 folded precursor RNA, herein designated VGAM 
FOLDED PRECURSOR RNA, of VGAM29 are further de- 
scribed hereinbelow with reference to Table 1. 

[0371] Nucleotide sequences of host target binding sites, such as 
BINDING SITE-I, BINDING SITE-II and BINDING SITE-MI of 
Fig. 1, found on, and schematic representation of the 
complementarity of each of these host target binding sites 
to VGAM 2 9 RNA, herein designated VGAM RNA, are de- 
scribed hereinbelow with reference to Table 2. 

[0372] pig. 9 further provides a conceptual description of novel 



bioinformatically detected regulatory viral gene, referred 
to here as Viral Genomic Record 30(VGR30) viral gene, 
which encodes an operon-like cluster of novel viral micro 
RNA-like genes, each of which in turn modulates expres- 
sion of at least one host target gene, the function and 
utility of which at least one host target gene is known in 
the art. 

[0373] VGR30 gene, herein designated VGR GENE, is a novel 

bioinformatically detected regulatory, non protein coding, 
RNA viral gene. The method by which VGR30 gene was 
detected is described hereinabove with reference to Figs. 
6-15. 

[0374] VGR30 gene encodes VGR30 precursor RNA, herein desig- 
nated VGR PRECURSOR RNA, an RNA molecule, typically 
several hundred nucleotides long. 

[0375] VGR30 precursor RNA folds spatially, forming VGR30 
folded precursor RNA, herein designated VGR FOLDED 
PRECURSOR RNA. It is appreciated that VGR30 folded pre- 
cursor RNA, herein designated VGR FOLDED PRECURSOR 
RNA, comprises a plurality of what is known in the art as 
hairpin structures. These hairpin structures are due to the 
fact that the nucleotide sequence of VGR30 precursor RNA 
comprises a plurality of segments, the first half of each 



such segment having a nucleotide sequence which is at 
least a partial inversed-reversed sequence of the second 
half thereof, as is well known in the art. 

[0376] VGR30 folded precursor RNA, herein designated VGR 

FOLDED PRECURSOR RNA, is naturally processed by cellu- 
lar enzymatic activity into at least 8 separate VGAM pre- 
cursor RNAs, VGAM 1 5 precursor RNA, VGAM 16 precursor 
RNA, VGAM 17 precursor RNA, VGAM 18 precursor RNA, 
VGAM 19 precursor RNA, VGAM 20 precursor RNA, 
VGAM 21 precursor RNA and VGAM 2 2 precursor RNA, 
herein schematically represented by VGAM1 PRECURSOR, 
VGAM 2 PRECURSOR, VGAM 3 PRECURSOR, VGAM4 PRE- 
CURSOR, VGAM 5 PRECURSOR, VGAM6 PRECURSOR, 
VGAM 7 PRECURSOR and VGAM 8 PRECURSOR respectively, 
each of which VGAM precursor RNAs being a hairpin 
shaped RNA segment, corresponding to VGAM PRECUR- 
SOR RNA of Fig. 8. 

[0377] The above mentioned VGAM precursor RNAs are diced by 
DICER COMPLEX of Fig. 8, yielding respective short RNA 
segments of about 22 nucleotides in length, VGAM15 
RNA, VGAM 16 RNA, VGAM 17 RNA, VGAM 18 RNA, VGAM 19 
RNA, VGAM 20 RNA, VGAM21 RNA and VGAM 2 2 RNA re- 
spectively, herein schematically represented by VGAM1 



RNA, VGAM2 RNA, VGAM3 RNA, VGAM4 RNA, VGAM5 
RNA, VGAM6 RNA, VGAM7 RNA and VGAM8 RNA respec- 
tively, each of which VGAM RNAs corresponding to VGAM 
RNA of Fig. 8. 

[0378] VGAM 15 RNA, herein schematically represented by 

VGAM1 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM 15 host 
target RNA, herein schematically represented by VGAM1 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM15 host target RNA, herein 
schematically represented by VGAM1 HOST TARGET RNA 
into VGAM15 host target protein, herein schematically 
represented by VGAM 1 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0379] VGAM 16 RNA, herein schematically represented by 

VGAM2 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM 16 host 
target RNA, herein schematically represented by VGAM2 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 



hibiting translation of VGAM16 host target RNA, herein 
schematically represented by VGAM2 HOST TARGET RNA 
into VGAM16 host target protein, herein schematically 
represented by VGAM2 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0380] VGAM17 RNA, herein schematically represented by 

VGAM3 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM17 host 
target RNA, herein schematically represented by VGAM3 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM17 host target RNA, herein 
schematically represented by VGAM3 HOST TARGET RNA 
into VGAM17 host target protein, herein schematically 
represented by VGAM3 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0381] VGAM18 RNA, herein schematically represented by 

VGAM4 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM18 host 
target RNA, herein schematically represented by VGAM4 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 



I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM18 host target RNA, herein 
schematically represented by VGAM4 HOST TARGET RNA 
into VGAM18 host target protein, herein schematically 
represented by VGAM4 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0382] VGAM19 RNA, herein schematically represented by 

VGAM5 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM19 host 
target RNA, herein schematically represented by VGAM5 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM19 host target RNA, herein 
schematically represented by VGAM5 HOST TARGET RNA 
into VGAM19 host target protein, herein schematically 
represented by VG AM 5 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0383] VGAM20 RNA, herein schematically represented by 

VGAM6 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM20 host 
target RNA, herein schematically represented by VGAM6 
HOST TARGET RNA, which host target binding site corre- 



sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM20 host target RNA, herein 
schematically represented by VGAM6 HOST TARGET RNA 
into VGAM20 host target protein, herein schematically 
represented by VGAM6 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0384] VGAM21 RNA, herein schematically represented by 

VGAM7 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM21 host 
target RNA, herein schematically represented by VGAM7 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM21 host target RNA, herein 
schematically represented by VGAM7 HOST TARGET RNA 
into VGAM21 host target protein, herein schematically 
represented by VGAM7 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0385] VGAM22 RNA, herein schematically represented by 

VGAM8 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM22 host 
target RNA, herein schematically represented by VGAM8 



HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM22 host target RNA, herein 
schematically represented by VGAM8 HOST TARGET RNA 
into VGAM22 host target protein, herein schematically 
represented by VGAM8 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0386] it is appreciated that a function of VGR30 gene, herein 

designated VGR GENE, is inhibition of expression of host 
target genes, as part of a novel viral mechanism of attack- 
ing a host. Accordingly, utilities of VGR30 gene include 
diagnosis, prevention and treatment of viral infection by . 
Specific functions, and accordingly utilities, of VGR30 
gene, herein designated VGR GENE, correlate with, and 
may be deduced from, the identity of the host target 
genes, which are inhibited by VGAM RNAs comprised in 
the operon-like cluster of VGR30 gene: VGAM15 host tar- 
get protein, VGAM 16 host target protein, VGAM 17 host 
target protein, VGAM 18 host target protein, VGAM 19 host 
target protein, VGAM20 host target protein, VGAM21 host 
target protein and VGAM22 host target protein, herein 
schematically represented by VGAM1 HOST TARGET PRO- 



TEIN through VGAM HOST TARGET PROTEIN respectively. 
The function of these host target genes is elaborated 
hereinabove with reference to VGAM 15, VGAM 16, 
VGAM 17, VGAM 18, VGAM 19, VGAM 20, VGAM 21 and 
VGAM 2 2 

[0387] pig. 9 further provides a conceptual description of novel 
bioinformatically detected regulatory viral gene, referred 
to here as Viral Genomic Record 31(VGR31) viral gene, 
which encodes an operon-like cluster of novel viral micro 
RNA-like genes, each of which in turn modulates expres- 
sion of at least one host target gene, the function and 
utility of which at least one host target gene is known in 
the art. 

[0388] VGR31 gene, herein designated VGR GENE, is a novel 

bioinformatically detected regulatory, non protein coding, 
RNA viral gene. The method by which VGR31 gene was 
detected is described hereinabove with reference to Figs. 
6-15. 

[0389] VGR31 gene encodes VGR31 precursor RNA, herein desig- 
nated VGR PRECURSOR RNA, an RNA molecule, typically 
several hundred nucleotides long. 

[0390] VGR31 precursor RNA folds spatially, forming VGR31 
folded precursor RNA, herein designated VGR FOLDED 



PRECURSOR RNA. It is appreciated that VGR31 folded pre- 
cursor RNA, herein designated VGR FOLDED PRECURSOR 
RNA, comprises a plurality of what is known in the art as 
hairpin structures. These hairpin structures are due to the 
fact that the nucleotide sequence of VGR31 precursor RNA 
comprises a plurality of segments, the first half of each 
such segment having a nucleotide sequence which is at 
least a partial inversed-reversed sequence of the second 
half thereof, as is well known in the art. 

[0391] VGR31 folded precursor RNA, herein designated VGR 

FOLDED PRECURSOR RNA, is naturally processed by cellu- 
lar enzymatic activity into at least 7 separate VGAM pre- 
cursor RNAs, VGAM 2 3 precursor RNA, VGAM 2 4 precursor 
RNA, VGAM 2 5 precursor RNA, VGAM 2 6 precursor RNA, 
VGAM 2 7 precursor RNA, VGAM 2 8 precursor RNA and 
VGAM29 precursor RNA, herein schematically represented 
by VGAM1 PRECURSOR, VGAM 2 PRECURSOR, VGAM 3 PRE- 
CURSOR, VGAM4 PRECURSOR, VGAM 5 PRECURSOR, 
VGAM 6 PRECURSOR and VGAM 7 PRECURSOR respectively, 
each of which VGAM precursor RNAs being a hairpin 
shaped RNA segment, corresponding to VGAM PRECUR- 
SOR RNA of Fig. 8. 

[0392] The above mentioned VGAM precursor RNAs are diced by 



DICER COMPLEX of Fig. 8, yielding respective short RNA 
segments of about 22 nucleotides in length, VGAM23 
RNA, VGAM24 RNA, VGAM25 RNA, VGAM26 RNA, VGAM27 
RNA, VGAM28 RNA and VGAM29 RNA respectively, herein 
schematically represented by VGAM1 RNA, VGAM2 RNA, 
VGAM3 RNA, VGAM4 RNA, VGAM5 RNA, VGAM6 RNA and 
VGAM7 RNA respectively, each of which VGAM RNAs cor- 
responding to VGAM RNA of Fig. 8. 
[0393] VGAM23 RNA, herein schematically represented by 

VGAM1 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM23 host 
target RNA, herein schematically represented by VGAM1 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM23 host target RNA, herein 
schematically represented by VGAM1 HOST TARGET RNA 
into VGAM23 host target protein, herein schematically 
represented by VGAM 1 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0394] VGAM24 RNA, herein schematically represented by 

VGAM2 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM24 host 



target RNA, herein schematically represented by VGAM2 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM24 host target RNA, herein 
schematically represented by VGAM2 HOST TARGET RNA 
into VGAM24 host target protein, herein schematically 
represented by VGAM2 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0395] VGAM25 RNA, herein schematically represented by 

VGAM3 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM25 host 
target RNA, herein schematically represented by VGAM3 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM25 host target RNA, herein 
schematically represented by VGAM3 HOST TARGET RNA 
into VGAM25 host target protein, herein schematically 
represented by VGAM3 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0396] VGAM26 RNA, herein schematically represented by 

VGAM4 binds complimentarily to a host target binding 



site located in an untranslated region of VGAM26 host 
target RNA, herein schematically represented by VGAM4 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM26 host target RNA, herein 
schematically represented by VGAM4 HOST TARGET RNA 
into VGAM26 host target protein, herein schematically 
represented by VGAM4 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0397] VGAM27 RNA, herein schematically represented by 

VGAM5 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM27 host 
target RNA, herein schematically represented by VGAM5 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM27 host target RNA, herein 
schematically represented by VGAM5 HOST TARGET RNA 
into VGAM27 host target protein, herein schematically 
represented by VG AM 5 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0398] VGAM28 RNA, herein schematically represented by 



VGAM6 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM28 host 
target RNA, herein schematically represented by VGAM6 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM28 host target RNA, herein 
schematically represented by VGAM6 HOST TARGET RNA 
into VGAM28 host target protein, herein schematically 
represented by VGAM6 HOST TARGET PROTEIN, both of 
Fig. 1. 

[0399] VGAM29 RNA, herein schematically represented by 

VGAM7 binds complimentarily to a host target binding 
site located in an untranslated region of VGAM29 host 
target RNA, herein schematically represented by VGAM7 
HOST TARGET RNA, which host target binding site corre- 
sponds to a host target binding site such as BINDING SITE 
I, BINDING SITE II or BINDING SITE III of Fig. 1, thereby in- 
hibiting translation of VGAM29 host target RNA, herein 
schematically represented by VGAM7 HOST TARGET RNA 
into VGAM29 host target protein, herein schematically 
represented by VGAM7 HOST TARGET PROTEIN, both of 
Fig. 1. 



[0400] ^ is appreciated that a function of VGR31 gene, herein 

designated VGR GENE, is inhibition of expression of host 
target genes, as part of a novel viral mechanism of attack- 
ing a host. Accordingly, utilities of VGR31 gene include 
diagnosis, prevention and treatment of viral infection by . 
Specific functions, and accordingly utilities, of VGR31 
gene, herein designated VGR GENE, correlate with, and 
may be deduced from, the identity of the host target 
genes, which are inhibited by VGAM RNAs comprised in 
the operon-like cluster of VGR31 gene: VGAM23 host tar- 
get protein, VGAM24 host target protein, VGAM25 host 
target protein, VGAM26 host target protein, VGAM27 host 
target protein, VGAM 2 8 host target protein and VGAM 2 9 
host target protein, herein schematically represented by 
VGAM1 HOST TARGET PROTEIN through VGAM HOST TAR- 
GET PROTEIN respectively. The function of these host tar- 
get genes is elaborated hereinabove with reference to 
VGAM23, VGAM24, VGAM25, VGAM 2 6, VGAM27, VGAM 2 8 
and VGAM 2 9 
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